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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 


page and prepared according to the specifications 
given below. 


2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 
author’s lines, must not exceed 75 lines av- 


eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style (1). 
Headings, tables, and references are avoided 
or, if essential, must be counted in the 75 
lines. Each Brief Report must be accom- 
panied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 75-line quota: * 


1An extended report of this study may be ob- 
tained without charge from John Doe, 300 Market 
St., Prospect 6, Mass. (giving the author’s full name 
and address), or for a fee from the American Docu- 
mentation Institute. Order Document No. ——, re- 
mitting $—— for microfilm or $—— for photo- 
copies. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (a) The extended report 
should be typed with single spacing for 
economy in duplication. (b) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual (1). 


Reference 


1. American Psychological Association. Council of 
Editors. Publication manual of the American 
Psychological Association (1957 rev.). Wash- 
ington, D. C.: American Psychological Asso- 
ciation, 1957. 
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Further Evaluation of the Archimedes Spiral Aftereffect 


Harold Gilberstadt, Jerome D. Schein 


VA Hospital, Minneapolis, and University of Minnesota 


and Albert Rosen 


State College of Washington 


The present investigation was stimulated by 
the report of Price and Deabler (5) who, 
after testing for the spiral aftereffect on 
groups of normal, nonorganic psychiatric, and 
organic brain-damaged hospitalized patients, 
concluded: “When failure to perceive the 
aftereffect is found it is almost certain that 
cortical involvement exists” (5, p. 301). They 
state that the results of a previous study by 
Freeman and Josey (1) led them to believe 
that this phenomenon might be used to de- 
velop a technique for the detection of organic 
brain damage. In a subsequent study, Gallese 
concluded that “the test almost always indi- 
cates organicity when organic scores are ob- 
tained although the converse is not true” (2, 
p. 257). 

This study represents an effort to evaluate 
further the spiral aftereffect as a screening 
device for cortical brain damage with a VA 
GM&S hospital sample. In such a setting 
where large numbers of patients are examined 
for diagnosis and possible acute and intensive 
treatment, it would be highly desirable for 
the clinical psychologist to be able effectively 
to go beyond the mere assessment of the effi- 
ciency of functioning in those areas for which 
we presently have standardized techniques 
such as the Wechsler-Bellevue. There is little 
question that we are still far from able to 
state with any degree of confidence to what 
extent specific functional or organic variables 
contribute to the observed or predicted per- 
formance of the patients we examine. Yet it 
is apparent that our goal in the area of dif- 
ferential diagnosis must be to rely as little as 
possible on the art of clinical diagnosis which 
is strong in the tradition of neurology and 


psychiatry and to continually strive to de- 
velop rigorously standardized techniques from 
which statistically sound predictions of known 
validity can be generated. 

The results obtained by Price and Deabler 
were of more than usual interest because, con- 
sidering the complexity of the human organ- 
ism and the nature of the variables involved 
in the loss of function from cortical brain 
damage, it is hard to overcome initially strong 
scepticism toward the claim that a single, 
brief test of one modality could develop into 
an almost infallible sign of such damage. As a 
consequence, the study by Price and Deabler 
seemed to demand further evaluation. 


Method 


Every effort was made to follow the instruc- 
tions of Price and Deabler for administering 
the test. An Archimedes Spiral of 920°, or 
two and one-half turns about its center, was 
constructed by painting a black spiral on a 
white background. This wheel was affixed to 
a reversible motor with a variable speed con- 
trol which was set at 100 rpm for the experi- 
ment. The apparatus was placed in an inside 
room which was well illuminated by fluores- 
cent lighting. The spiral was placed at eye 
level and the subject was seated at a distance 
of eight feet from the apparatus. 

One experimenter was employed to test all 
of the patients. Four trials were given alter- 
nating clockwise and counter-clockwise rota- 
tions. Scoring was on the basis of 1 or O for 
each trial. Samples of responses to the after- 
effect scored 1 were: “It’s coming toward me.” 
“Getting larger.” “Expanding.” “Going in the 
opposite direction.” “Swelling.” “Shrinking.” 
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“Opposite.” Samples of 0 scoring were: “Noth- 
ing.” “Stopped.” “Standing there.” “Standing 
still.” “Same.” 

A check of scoring reliability by four judges 
on the first 35 cases showed no disagreements 
except for one instance in which a judge mis- 
read the handwriting on the recording form. 


Sample 


During a three-month experimental period 
all available patients admitted to the Psy- 
chiatry and Neurology Services respectively 
were tested. The total number of admissions 
to Psychiatry were 112 and to Neurology 175. 


Table 1 


Diagnoses and Scores on Archimedes Spiral Aftereffect 
of Psychiatric Sample 


(N = 87) 








Distribution of 
Scores 


Diagnosis 





Organic (NV = 7) 


Acute brain syndrome associated 
with alcohol intoxication 

Chronic brain syndrome 
psychosis 

Encephalitis, epidemic chronic 

Encephalopathy, chronic progres- 
sive 

Depressive reaction with hyper- 
tension and cerebral vascular 
accident 

Alzheimer’s disease 


with 


Nonpsychotic (VN = 57) 


Anxiety reaction 

Conversion reaction 
Depressive reaction 
Dissociative reaction 
Obsessive-compulsive reaction 
Psychophysiologic reaction 
Adult situational maladjustment 
Chronic alcoholism 

Dysocial reaction 

Emotional instability reaction 
Inadequate personality 
Mental deficiency 


_ 
eee WN SN Ue © 


Psychotic (V = 23) 


Involutional melancholia 
Paranoid schizophrenia 
Schizophrenic reaction other 
Manic depressive 
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To maintain standard testing conditions, an 
initial requirement was instituted that only 
patients who were able to come to the testing 
room would be included; the criterion being 
the judgment of the nurse in charf¥e of the 
ward. As a result of this condition, 16 pa- 
tients were omitted from the Psychiatry sam- 
ple. An inspection of the cases showed physi- 
cal illness, assaultiveness, and suicidal risk to 
be the bases of the nurses’ decisions. Seven 
patients were omitted through administrative 
error. Two patients who did come for testing 
were unable to attend to the task. Table 1 
shows the distribution by diagnosis and score 
of the remaining 87 cases which constitute 
the Psychiatry sample. 

Thirty-five patients admitted to Neurology 
were omitted from the sample of these cases. 
Fifteen were discharged before testing could 
be done, six expired, eleven were bedridden, 
three were aphasic and after reporting for the 
test did not have sufficient verbal ability to 
be tested under the standard conditions. 
Table 2 gives the distribution by diagnosis 
and score of the remaining 140 cases. 

Mean age of the Psychiatric sample is 41.2 
years and mean age of the Neurologic sample 
43.4 years. Both groups were tested an av- 
erage of five days after admission. Diagnoses 
in all cases were final diagnoses and were un- 
contaminated by test results which were with- 
held until all diagnoses had been obtained. 


Results and Discussion 


In reporting their findings, Price and Deab- 
ler used the terms “organic,” “brain dam- 
aged,” and “cortical” as though they were 
equivalent. That the first two terms may con- 
vey very similar meaning is understandable, 
but the term cortical has a highly specific 
connotation referring to a particular portion 
of the brain. Price and Deabler’s summary 
statement led us to conclude that, despite the 
ambiguous use of these words, their intention 
was to restrict the test to the detection of 
cortical brain damage. It seemed imperative, 
then, to dichotomize our organic brain syn- 
drome cases into cortical and noncortical sub- 
samples. Parenthetically, it may be noted 
that assigning only cortical cases to the cate- 
gory of expected positives was more favor- 
able to the hypotheses of Price and Deabler 
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Table 2 
Diagnoses and Scores on Archimedes Spiral Aftereffect 
of Neurologic Sample 
(N = 140) 


Distribution of 
Scores 


Diagnosis i 2 aie 


Organic brain syndrome* (NV = 
Traumatic brain injury 
Demyelinating disease of brain 
Brain tumor 
Cerebral vascular accident 
Convulsive disorder 
Chronic brain syndrome 
Meningitis 
Paralysis agitans 
Other 

Organic non-brain syndrome» 

(N = 56) 
Organic disease not found (N = 8) 
Psychoneurosis® N = 8) 


N we tO 
_ 
COwnre Uhre wu 


nw 
an > 


® Specific diagnoses iu each category are as follows: traumatic 


brain injuries—2 posttraimstic epilepsy ; 1 cerebral concussion, 
1 basilar skull fracture, 2 cerebral contusion, 1 penetration 
wound right parietal area, 1 postconcussion syndrome, 1 post- 
traumatic encephalopaihy; demyelinating diseases—3 amyo- 
trophic lateral sclerosis, 1 multiple sclerosis, 1 primary lateral 
sclerosis, 1 pyramidal! tract disease bilateral; brain tumors—1 
brain tumor, 1 retrobulbar, 1 right temporal, 1 right thalamic; 
cerebral vascular accidents—1 c.v.a., 1 embolus middle cerebral 
artery with convulsive disorder, 1 occlusion of right internal 
carotid, 4 cerebral thrombosis, 1 internal carotid artery, 1 post 
cerebellar artery, 1 old with motor aphasia and right hemi- 
paresis, 1 right hemisphere; convulsive disorders—1i black-out 
spells, 1 cetrencephalic seizures, 6 grand mal epilepsy, 1 grand 
mal and petit mal epilepsy, 1 grand mal with drug withdrawal, 
8 epilepsy of undetermined etiology, 2 psychomotor seizures; 
chronic brain syndrome—1 chronic alcoholism with secondary 
diffuse cortical atrophy, 1 degenerative brain disease, 2 unknown 
etiology diffuse ; meningitis—1 pneumococcal type III, 1 syphilis 
tertiar ther—2 metastatic carcinoma, 1 cervical mass, 1 skull 
defect right frontal, 1 subarachnoid hemorrhage, 1 tabes 
dorsalis, 1 headache with retinal hemorrhages, 1 hypertensive 
cardiovascular disease. 

> Specific diagnoses are as follows: 1 arachn-id cyst, 1 
arteriosclerotic heart disease, 1 back-pain undetermined eti- 
ology, 2 Bell's palsy, 1 cervical spine trauma, 1 degenerative 
cord disease, 1 epididymal cyst, 1 ganglion cyst right wrist, 1 
metastatic hypernephroma, 4 low back syndrome, 1 lumbar 
strain, 1 Meniere's disease, 1 myotonia dystrophica, 1 nephro- 
tosis, 1 posttraumatic neuropathy, 24 herniated nucleus 
pulposus, 1 occlusion anterior spinal artery, 3 peripheral 
neuritis, 3 radiculitis, 1 rheumatic heart disease, 1 sacroiliac 
strain, 1 sciatic syndrome, 1 possible slipped disc, 2 syncope of 
undetermined etiology. 

¢ Specific diagnoses are as follows: 1 anxiety reaction, 1 func- 
tional cephailgia, 4 conversion reaction, 1 depressive reaction, 1 
psychoneurosis severe with paranoid trends. 


than the inclusion of all brain-damage cases 
would have been. 

We relied on the best judgments available 
to classify our sample on the basis of diag- 
nosis. Nevertheless, we did so with reserva- 

1We wish to thank E. T. Bell, Emeritus Profes- 
sor of Pathology, University of Minnesota Medical 
School, and David Webster of Minneapolis VA Hos- 


pital neurology staff for their assistance in categoriz- 
ing the neurological cases. 
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tion, since a realistic consideration of the 
problem must lead to the conclusion that 
within the limits of a study of this type it is 
not possible to establish a very satisfactory 
criterion of cortical brain damage. Even with 
unlimited resources there still would be seri- 
ous weaknesses in the criterion because of the 
difficulties inherent in the accurate determi- 
nation of the localization and extent of corti- 
cal brain damage with even the best of cur- 
rent neurological techniques. 

Price and Deabler did not elaborate on the 
basis by which they conclude that their cases 
of CNS syphilis, cerebral vascular accident, 
chronic brain syndrome associated with alco- 
holism, cerebral arteriosclerosis, posttraumatic 
encephalopathy, and advanced Parkinson’s 
disease have known cortical involvement be- 
yond stating that the diagnoses were based 
on hospital staff action. Because of our lack 
of confidence in such diagnoses and for the 
reason noted above, we were conservative and 
assumed that only the 30 cases, which in- 
cluded traumatic brain injury, brain tumor, 
cerebral vascular accident, chronic brain syn- 
drome, and meningitis (listed under the or- 
ganic brain syndrome subgroup in Table 2), 
plus the seven cases from the organic sub- 
group (listed in Table 1) have cortical brain 
damage of a pathological degree. 

Meehl and Rosen’s (3) procedure for the 
evaluation of any assessment or prediction 
device was followed in the analysis of the 
test results. In the first three columns of 
Table 3, f denotes the number of diagnosed 
cases of cortical damage receiving each score, 
cf indicates the cumulative frequency and P, 
represents the proportion of the total group 
of those with cortical damage who scored at 
or below each scoring interval. The next three 
columns give the same data for the group 
without cortical damage. In order to conserve 
space we have eliminated the intermediate 
calculations given for illustrative purposes by 
Meehl and Rosen. Hr represents the propor- 
tion of correct classifications for the given 
cutting line; i.e., the number of valid posi- 
tives plus the number of valid negatives di- 
vided by the total number of cases. 

When all cases are considered, it will be 
noted that the test does no better than the 
base rates; i.e., if we simply called all cases 
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Table 3 


Cumulative Frequency Tables for Calculation of Optimal Cutting Scores 
for Archimedes Spiral Aftereffect 








Cortical 


Noncortical 





cf 





cf 





12 
15 
18 
24 


Combined sample* 


Psychiatry cases” 


8 
8 
il 
17 
30 


Neurology cases* 


13 
22 
29 
52 
190 





* Base rate for cortical brain damage in this sample is .16. 
> Base rate for cortical brain damage in this sample is .08. 
¢ Base rate for cortical brain damage in this sample is .21. 


free of cortical damage we would correctly 
identify 84% of the sample. Using the test, 
we correctly identify 83.4%, provided that we 
use a cutting score between 0 and 1. Using 
the cutting line between 2 and 3 suggested by 
Price and Deabler, we would classify the cases 
correctly only 78.9% of the time as compared 
to the base rates of 84%. 

In Psychiatry, where 8% of the sample 
were diagnosed as having cortical damage, if 
the cutting line were set between 1 and 2, 
93% of the cases would be correctly identi- 
fied as compared to 92%, using only the base 
rate and calling all cases non-brain-damaged. 
Maximum effectiveness is again attained with 
a cutting line between O and 1 where 94.3% 
of the cases are correctly identified. 

In the Neurologic sample, the test is un- 
able to improve upon the base rates. By rou- 
tinely calling all cases not cortically damaged, 
79% would be correctly classified. Using a 
cutting score between 0 and 1 the test would 
attain its maximum number of correct identi- 
fications, 77%. Use of the test, then, would 
result in a 2% decrement in predictive ac- 
curacy. 

In addition to the total number of correct 


classifications, evaluation of a clinical instru- 
ment is dependent upon the importance at- 
tributed to detecting the greatest number of 
cortically damaged cases and to the cost 
and the facilities available for further neuro- 
logical evaluation of mislabeled, noncortically 
damaged patients. If one wished to avoid miss- 
ing any cortical cases regardless of the cost, 
he would choose a cutting line between 3 
and 4 for a population resembling our com- 
bined Neurology and Psychiatry samples. 
This would yield the detection of 64.9% 
(P,) of the cortical cases at the expense of 
misclassifying 27% (P2) of the noncortical 
cases. If one were interested in avoiding ad- 
ditional neurological examinations because of 
staff limitations, he wouid select a cutting line 
between 0 and 1 where 32.4% of the cortical 
cases are identified, and only 6.8% of the 
noncortical cases are misdiagnosed by the test. 

Table 4 contains a similar analysis of the 
data presented by Price and Deabler (5), 
Gallese (2), and Page, Rakita, Kaplan, and 
Smith (4). We have also repeated the data 
for our combined Neurology and Psychiatry 
samples to simplify comparison of all four 
studies. Since none of the other three studies 
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report the base rates for their populations, we 
have used the empirical rate obtained from 
our combined sample (.16) as the estimate 
with which to make the necessary calculations. 

It will be noted in Table 4 that the maxi- 
mum total hit rate (H;) varies from 96.4% 
in the Price and Deabler study, to 90.4% for 
Gallese, to 81.0% for Page et al., to 83.4% 
for the Minneapolis sample, and that the cut- 
ting line at which the maximum is obtained 
alternates from a point between 2 and 3 in 
the first two studies to between 0 and 1 for 
the last two studies. Gallese stated that his 
method was more apt to reduce the propor- 
tion of false positives inasmuch as he con- 
tinued to press for a “normal” response if it 
were not at first obtained. He was also more 
liberal in scoring responses in the “normal” 
direction. In the Page et al. study, the spiral 
was revolved in one direction only for three 
trials rather than in two directions for four 
trials. 

The present study, however, does not differ 
from Price and Deabler’s in test administra- 


Table 4 


Percentage of Hits for Each Cutting Line in Different 
Hospital Samples Assuming a Base Rate of .16 


Percentage of Hits in 
Each Study 

Price & Page Minne- 

Deabler Gallese ef al. apolis 


Cutting Lines Between 





0 and 1 

100.0 
92.9 
93.6 


Hp (Positive hit rate) 
Hw (Negative hit rate 
Hr (Total hit rate) 


1 and 2 
Hp 
Hy 
Hr 


2 and 3 
Hp 
Hy 
Hr 

3 and 4 
Hp 
Hy 
Hr 





* Data unavailable. 
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tion. The reduction in total hit rate to 83.4% 
(using a cutting line between O and 1) is in 
our opinion largely a result of differences in 
sampling. Our sample was drawn randomly. 
All consecutive admissions who could be 
tested were tested, with exceptions as noted 
above. Furthermore, our patients were prob- 
ably less deteriorated than those selected by 
Price and Deabler though we cannot be cer- 
tain because of the paucity of information 
they provided. We feel that the lack of de- 
scription of their sample represents a very 
serious deficiency in the usefulness of their 
results. It is possible to select samples of 
brain-damaged and psychotic populations such 
that the brain-damaged, being severely de- 
teriorated, can fail almost any task given 
them and the psychotics, being relatively in- 
tact, obtain a high rate of success. Such a 
procedure yields a “mythical population,” 
ie., one which does not occur in practice. 
Tests validated on such groups will give de- 
ceptively good discrimination between groups, 
only to fail on cross-validation. 

It is perhaps well to note that developing a 
mythical population is not entirely without 
virtue. If the means of obtaining such a sam- 
ple are carefully reported, it may be possible 
to locate or duplicate a reasonably similar 
sample. On the other hand, nonrandomly se- 
lecting a sample may result in one being un- 
able to validly predict what the results might 
be even in one’s own institutional setting 
where the sample was drawn. We can only 
speculate, therefore, as to the nature of the 
differences between the other samples and the 
one drawn for this study. Gallese, for exam- 
ple, reports that the median duration of hos- 
pitalization for his organic group is 65 months 
and for his schizophrenic group 33 months in 
contrast to our groups who were tested an 
average of five days after admission to the 
hospital. There are also substantial differences 
in the percentage of cases of CNS syphilis in 
our samples and those of Gallese, and it is 
probable that this would be the case in a com- 
parison with the Price and Deabler sample. 
The fact that Gallese obtains results inter- 
mediate between ours and those of Price and 
Deabler might suggest that he has drawn 
a sample which more closely approximates 
theirs on such significant variables. 
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Table 4 also contains for each cutting line 
the positive hit rate 





H true positives 
P true positives + false positives 


and the negative hit rate 


(1. = 


Thus, in Price and Deabler’s study, 100% of 
cases obtaining a score of 0 are diagnosed as 
cortically damaged, while this is true for only 
43.2% of the Page et al. sample and 47.4% 
of our sample. Conversely, if we wished to 
maximize the negative hit rate, the cutting 
line should be placed between 3 and 4 where 
the rate of correct negative predictions was 
99.7% for the Price and Deabler sample and 
91.6% in our sample. One can, then, select 
a cutting line depending on (a) which sample 
most closely resembles the one in which the 
test is to be used and (4) whether one wishes 
to maximize Hp, Hy, or Hr. 

In most clinics, use of the spiral aftereffect 
as a screening device will be considered in re- 
lation to the efficiency of the instrument as 
well as the kinds of decisions which will be 
made on the basis of positive results. Assum- 
ing that further cross-validation will not cause 
much shrinkage, it might be practical to use 
the test with samples which resemble our 
Psychiatric sample where only 8% false posi- 
tives were obtained with a cutting score be- 
tween 0 and 1. Considering the small amount 
of time and effort involved, it would be fea- 
sible to administer the test to all patients ad- 
mitted to Psychiatry and to recommend a 
neurological workup for those who score be- 
low this cutting line. 

The test would not seem to have utility for 
the screening of Neurology patients. Since 





= true negatives ) 
true negatives + false negatives / © 
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there is such an unfavorable ratio between 
valid and false positives it would not appear 
feasible even to advise the neurologist to use 
the test findings as an added weight in his 
diagnostic formulation. 


Summary 


Evaluation of 87 consecutive testable ad- 
missions to the Psychiatry Service and 140 
to the Neurology Service of the Minneapolis 
Veterans Administration Hospital by means 
of the Archimedes Spiral Aftereffect indicated 
that the test had very limited clinical useful- 
ness for detecting pathological cortical brain 
damage. Use of the test did not significantly 
improve the diagnostic efficiency which would 
be obtained by using the base rates. 

Problems of sampling and criteria were dis- 
cussed. Differences between this study and 
those previously reported appear to be largely 
a function of differences in the samples 
studied. Problems inherent in the use of non- 
random samples for purposes of validation 
were also pointed out. 


Received August 21, 1957. 
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The spiral aftereffect may normally be seen 
when a circular disk supporting an Archi- 
medes spiral is rotated within certain limits 
of speed. During the period of rotation, the 
spiral appears either to expand or to contract, 
depending on both the direction of rotation 
and the direction of the coil of the spiral. 
When the rotating movement of the disk is 
interrupted, an aftereffect occurs so that the 
spiral appears to rotate in the opposite di- 
rection. 

Like many techniques being reactivated in 
current perception research, the spiral after- 
effect has a substantial history. In 1850, Pla- 
teau, the Belgian scientist, reported the first 
systematic study of the moving spiral. More 
lately, the relation between brightness, direc- 
tion, velocity, etc., and the perception of the 
aftereffect has been investigated (3, 9). 

Along with the interest in the various pa- 
rameters of the experience is the attempt to 
relate the performance on the spiral to dis- 
crete mental functions and to brain damage. 
Freeman and Josey’s paper (1) on the corre- 
spondence between memory impairment and 
reduced ability to perceive the aftereffect, 
spurred Price and Deabler (7) to measure the 
relationship between the aftereffect and corti- 
cal damage. The results of this latter effort 
are most impressive in that virtually no over- 
lap in performance was found between a 
group of brain-damaged patients and two con- 

1 Now at Crile VA Hospital, Medical Service. 

2 Now at VA Hospital, Denver. 

8 The authors wish to express their appreciation to 
Joen Fagan, graduate student at Pennsylvania State 
University, and Ted Rice, graduate student at West- 


ern Reserve University, who assisted in collecting and 
evaluating the data. 


trol groups. In the brain-damaged group 98% 
failed to perceive the aftereffect in each of 
the four consecutive trials, while 95% of the 
nonorganic psychiatric population, and 92.5% 
of the normal population, were able to per- 
form satisfactorily on each of the four trials. 
In a subsequent paper, Garrett, Price, and 
Deabler (2) repeated their original work with 
40 chronic brain-injured patients. Their find- 
ings substantiate their previous results, inas- 
much as 97.5% of the sample failed to per- 
ceive the aftereffect in each of the four trials. 
Page et al. (6) have provided another test re- 
lating performance on the spiral to cortical 
damage. While their results were in the same 
direction as Price and Deabler’s, the ability 
of the spiral test to discriminate was patently 
less efficient. In another study conducted by 
Smith and Goldberg,* the spiral was adminis- 
tered to normals, psychiatric patients, organ- 
ics, and postshock patients. The test proved 
capable of discriminating between pathologi- 
cal processes and normals, but apparently 
failed to distinguish organic impairment from 
the functional disorders. 

The present study was undertaken to 
cross-validate the findings reported by Price 
and Deabler and particularly the statement, 
“While adequate explanation of the phe- 
nomenon remains to be hypothesized and 
verified, the results are such as to justify 
present clinical use of this technique in diag- 
nosis of organicity (7, p. 302). In addition 
the following problems were evaluated: (a) 
the effect of the direction of the coil of the 
spiral on difficulty and duration of the after- 


*P. Smith and L. Goldberg. Personal communica- 
tion. April, 1957. 
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effect, (b) the relation between visual acuity 
and the aftereffect, and (c) the relation be- 
tween the aftereffect and various neurological 
indicia. 

Method 
Subjects 


With the exception of bedfast patients and 
those patients who were too incapacitated to 
cooperate, all patients hospitalized during a 
three-month period on a neurological ward in 
a VA hospital were selected for study. Some 
months after the experimental data were col- 
lected, the patients’ clinical charts were re- 
viewed for information relevant to the follow- 
ing areas: diagnoses,® admission symptoms, 
electroencephalographic (EEG) reports, pneu- 
moencephalographic studies, skull X-ray re- 
ports, findings on spinal fluid examination, 
and reports on visual fields. In certain cases, 
the chart failed to yield data for each vari- 
able. The availability of this type of infor- 
mation was largely a function of each pa- 
tient’s condition and the consequent need for 
laboratory study. 


Apparatus 


A six-inch Archimedes spiral formed in 24 
convolutions was drawn and photographed. 
Two identical spirals were developed, except 
that the second was the photographic rever- 
sal of the first. The two spirals were then 
mounted on two 6-in., circular, metal disks. 
A small metal collar was centered and fas- 
tened to the reverse side of the disks. The 
collar was threaded to complement a machine 
screw which had been fitted to the center post 
of a phonograph turntable. The turntable, to- 
gether with its driving mechanism, was sup- 
ported on an adjustable laboratory stand so 
that the target spiral could be presented at 
eye level to a subject (S) seated at a distance 
of 8 ft. The turntable was activated by a 
78 rpm unidirectional motor. Spiral A seemed 
to expand while rotating and normally would 
be said to contract in the motionless after- 
effect condition. Spiral B, since it was the 
photographic reversal, and since the motor 
was unidirectional, appeared to be contract- 
ing while rotating and to be expanding in the 
motionless aftereffect condition. 


5 All diagnoses reported in this study were estab- 
lished after weeks of observation and study. 
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In addition to the spiral apparatus, a Snel- 
len chart was used to assess visual acuity. As 
with most work on visual acuity, the minimal 
separable or visual angle was considered the 
invariant. However, since the influence of dis- 
tance is not clearly defined in present-day 
knowledge of visual acuity, it was also con- 
trolled. The size of the room used for testing 
was such that the Ss had to read the chart 
from a distance of 18 ft. By convention, read- 
ings of visual acuity, thus obtained, would be 
adjusted to comply with a 20-ft. distance 
from the chart. Since the distance error in 
the ratio was constant in this study, it was 
not necessary to make this adjustment. In all 
calculations based on Snellen chart readings, 
the denominator in the ratio was used exclu- 
sively (i.e., with readings such as 18/30 or 
18/20, the computations were performed only 
on the denominators). 


Testing Procedure 


Two experimenters (Es) were present dur- 
ing all testing. One E typically discussed the 
procedure with S$ and recorded behavior. The 
second E observed and adjusted the appa- 
ratus. The Ss were encouraged to wear glasses. 
When glasses were used, they were required 
throughout the testing. All visual tasks in this 
experimentation were viewed binocularly un- 
der artificial overhead light. 


The patient was told that the tests about to be ad- 
ministered were a special eye examination. The Snel- 
len chart was then called to the patient’s attention. 
He was told to read each succeeding line from left to 
right until interrupted. The score on this task was 
the denominator of the acuity ratio of the most diffi- 
cult line read faultlessly. The criterion required that 
the line be read perfectly twice and that the suc- 
ceeding line be faulted twice. The latter condition 
was obviously inapplicable in the case of an error- 
less performance on the entire chart. The Ss were 
prompted to rest their eyes whenever a line was 
failed. 

The spiral task was then introduced. The two 
spirals were presented twice each for every S in 
“controlled” random order for a total of four ex- 
posures per S. The order of presentation was thus 
randonized for blocks of six patients: ABAB; ABBA; 
AABB; BBAA; BABA; BAAB. Hereafter, each of 
the four exposures are referred to as a presentation. 
Between presentations, a cardboard shield was used 
to cover the apparatus and the S was instructed to 
close his eyes to rest them. Every effort was made 
to make the interval between presentations uniform 
both for time and auditory cues. This was necessary 
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in order to reduce the extraneous information which 
might be used to anticipate which spiral was being 
readied for the next presentation. 

The instructions for the spiral task were, “Now 
please look at the center of the design on the wheel 
and keep looking right at the center.” At that point, 
the motor which activated the spiral was switched 
on. After 10 seconds, the S was asked to describe 
what he saw. All Ss included in the experimental 
sample were able to describe some form of expand- 
ing or contracting experience while the spiral was 
rotating. The form described depended, of course, on 
which of the two spirals was being used during the 
initial presentation. After thirty seconds of observa- 
tion, the power was cut and the spiral came to rest. 
The average temporal lag between the time the power 
was cut and the time the spiral came to a complete 
stop was 24 seconds. During this interval, S was 
cautioned again to “Keep your eyes on the center 
of the design on the wheel.” After five seconds 
elapsed the S was asked a series of questions in the 
following sequence: (a) What do you see now? (b) 
Does it look different now? (c) Does it look like it’s 
moving now? (d) Does it look like it’s getting larger 
or smaller now? 

The procedure was in marked contrast to the one 
employed by Price and Deabler. They assert “there 
was little ambiguity as to the presence or absence of 
the aftereffect with most Ss” (7, p. 302). In the pre- 
testing of the apparatus used in the present study, 
there were many occasions when S’s verbal report 
failed to establish the legitimacy of the aftereffect 
experience. The graduated series of questions was de- 
signed to uncover the essence of S’s perceptual ex- 
perience. A response at any level of questioning which 
indicated that the S experienced a change either in 
direction, size, or in distance was acceptable as long 
as the experience was in the expected direction (i-., 
with a spiral which would normally be seen as con- 
tracting in the aftereffect condition, reports such as 
moving away, increasing distance, or decreasing size 
were all equally “correct’). All acceptable responses 
were scored plus. When S’s response for any pres- 
entation was “correct” at any one of the four levels 
of questioning, that presentation was completed. 
Thus, all four levels of questioning were used only 
for responses which were not sufficiently clear to be 
scored with confidence. A response which ultimately 
failed to convince the E that the S perceived the 
aftereffect or which, at any level of questioning, de- 
scribed the movement in the “wrong” direction, was 
scored as a minus performance for that presenta- 
tion. Allowedly, the verbiage employed in the ques- 
tions tended to promote ideation about movement, 
distance, and size. Nevertheless, a “correct” response 
had to incorporate direction of change. In spite of 
the nature of the questions, the direction of change 
had a chance expectancy of two correct guesses in 
four presentations. Accordingly, the S was classified 
as a “Seer” if he scored plus in at least three of the 
four presentations. Ss earning two or more minus 
scores were classified as “Nonseers.” 

In the case of “Nonseers,” the fourth presentation 
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completed their performance. The “Seers,” however, 
were asked to continue. The particular spiral used in 
the fourth presentation was retained through four 
additional trials, during which the duration of the 
aftereffect was ascertained. These four duration ex- 
posures will hereafter be referred to as trials. The 
time per trial was recorded in seconds from the in- 
stant the spiral was set into motion, through the 30- 
second exposure period, and up to the moment the 
S indicated that the aftereffect seemed to be extin- 
guished. The directions for these four trials were, 
“Do you remember just a moment ago when we 
stopped the wheel, you told me that the design on 
the wheel looked like it was ‘_—__________’_ (the 
subject’s own descriptive phrase was used). Now 
when we start the wheel this time, look at the center 
again and keep looking at it after I stop it. Then 
tell me when the ‘__________’ (inserting the 
same phrase again) first stops.” After completing four 
trials, the “Seers” were excused from the test situa- 
tion. 


Results 


The final sample tested numbered 110 pa- 
tients, all of whom were admitted to the hos- 
pital ostensibly for the diagnosis and treat- 
ment of neurological conditions. The mean age 
of the sample was 38.4 years, with an SD of 
10.99 years and a range of 21 to 64. The pri- 
mary diagnoses represented in the sample are 
presented in Table 1 in order to provide some 
notion of the character of the ward popula- 
tion at the time the testing was conducted. 
Of the 110 patients tested, 31 (28%) were 
classified by their performance as “Nonseers.” 
The incidence of “Nonseers” for the various 
diagnoses is also presented in Table 1. 


Comparisons between “Seers” and “Nonseers” 


The means of the ages for the “Seers’” and 
“Nonseers” were contrasted, and the lack of 
significance is summarized in Table 2. In or- 
der to compare the means of the Snellen chart 
performances of the “Seers” and “Nonseers,” 
a square root transformation was used since 
the distributions were rather severely skewed. 
The results of this comparison as presented in 
Table 2 were significant (< .01). From this 
finding it may be said that “Nonseers” have 
less acute vision than “Seers” and that this 
relationship does not appear to be related to 
age. 


Comparison of the Two Spirals 


The distributions of duration times for each 
of the two spirals also required normalizing. 
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Table 1 





Diagnoses and Incidence of “‘Nonseers” 





“Non- 


Diagnosis N seers” 


Diagnosis 





Multiple sclerosis 24 1 
Encephalopathy 15 
Epilepsy, grand mal 15 
Parkinsonism 7 
Neurotic reaction 7 
Epilepsy, 
Jacksonian 4 
psychomotor 4 
idiopathic 3 


3 
0 


0 





Note. 
nerve, cerebral embolism,* Friedrich's ataxia, poliomyelitis, 


There was also one each of the following diagnoses: Muscular dystrophy, migraine, toxic neuropathy,* papillitus optic 


Syncope 
Arteriosclerosis 
Intracranial aneurysm 
Cerebral thrombosis 
Alcoholism, chronic 
Cephalgia 
Amyotrophic lateral 
sclerosis 


pinal muscular atrophy, neuralgia, Huntington's chorea,* 


osteoarthritis, transverse myelitis, and schizophrenic reaction, paranoid type.* 


* Denotes a ‘‘Nonseer.”’ 


Again the square root transformation was em- 
ployed. The results of the comparison of the 
mean duration times obtained on the two 
spirals are presented in Table 3. It is to be 
remembered that the duration trials were 
administered only to “Seers.” Accordingly, 
Spiral B, which normally is seen as expand- 
ing in the motionless condition, is seen for a 
significantly longer time than is Spiral A 
(< .001). 

Next, the relative difficulty of the two 
spirals was examined. Spiral A (contracting 
aftereffect) yielded 60 failures (27%) out of 
220 exposures. Spiral B yielded 36 failures 
(16%) in a similar number of exposures. 
These percentages were tested by the critical 
ratio formula and proved quite significant 
(< .01). Thus Spiral B (expanding after- 
effect) was appreciably less difficult than 
Spiral A. The performance on the two spirals 
was then distinguishable both for duration 
and difficulty of the aftereffect. The after- 


effect on Spiral B was easier to see and was 
seen for a longer time interval. 


Relation Between Spiral and Clinical Criteria 


The information gathered from the patients’ 
clinical charts was related to the performance 
on the spiral aftereffect. This approach in- 
volved analyzing the chart data on five inde- 
pendent measures, as well as via a global im- 
pression of brain damage. The five indices 
used were EEGs, pneumoencephalograms, 
skull X rays, spinal fluid, and visual fields. 
The global judgments integrated these five 
indices, as well as the admission symptoms 
and the diagnoses into a single, aggregate 
judgment. 

Two judges independently evaluated the 
abstracted chart summaries for each of the 
five indices with 20 of the patients. These 20 
charts were selected in a random manner with 
the one stipulation that information on the 
five indices was available. The judges were in- 


Table 2 


Age and Visual Acuity in ‘‘Seers” and ‘“‘Nonseers” 











Mean scores 





“Non- 
” 


Variable “Seers” seers 


SD 





“Non- 
“Seers” seers” 





37.71 
4.4871 


40.26 
5.9007 


Age 
Visual acuity* 


10.41 
1.3021 


12.35 
2.4707 





* Normalized via a square root transformation. 
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Table 3 


Mean SD i p 





152 6.3861 


i 6001 
B 163> 6.8879 


8018 


6.2969 <.001 


* Normalized via a square root transformation. 
b One patient failed to perceive the aftereffect on one of the 
duration trials on Spiral B. 


structed to assign each case to one of the 
three categories: obvious brain damage, ques- 
tionable brain damage, or no evidence of brain 
damage for each variable (EEG, etc.). The 
two judges agreed perfectly on 84% of the 
ratings. With this degree of consistency, it 
was possible to use the judgments of a single 
judge for the purpose of relating the five in- 
dices to the spiral performance. Whenever 
data were not available, the patient was ex- 
cluded from the analysis on that particular 
index. The contingency coefficients between 
the spiral performance and the neurological 
measures, as well as the levels of significance, 
are presented in Table 4. Only chance asso- 
ciation was demonstrated between the per- 
formance on the spiral test and the three 
clinical variables: EEGs, pneumoencephalo- 
grams, and skull X rays. However, in the case 
of the visual field examinations and the spiral 
fluid, significant correlations (both < .05) 
were obtained with the performance in the 
aftereffect situation. The global estimate of 
cortical damage was made by one of the au- 
thors.* The contingency coefficients along with 
the level of significance (< .05) are also pre- 
sented in Table 4. 


Discussion 

When certain stimulus patterns are ob- 
served in motion at critical speeds, the ac- 
tivity may appear to continue for some ap- 
preciable time after the objective motion 
ceases. Such experience is often termed mo- 
tion afterimage. The spiral aftereffect is a 
form of subjective motion afterimage, com- 
plicated by a negative experience of changing 
size or depth. As such, it incorporates radial 
movement, apparent movement toward or 
away from an observer, and gamma move- 


These judgments were made by the neurologist 
cooperating on this project. 
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ment, another form of apparent movement 
usually associated with a swelling of the size 
of a suddenly illuminated stimulus. 


The Spiral Aftereffect as a Clinical Tool 


The results of the present study tend to 
confirm the general usefulness of the spiral 
aftereffect as a heuristic laboratory technique 
for studying brain pathology. There is, how- 
ever, some real question about the propriety 
of the technique in clinical practice. The 
spiral task as explored in the present study 
does not appear to live up to the claims made 
for it by Price and Deabler as an individual 
diagnostic measure. 

The critical need in clinical practice is for 
a technique sufficiently valid to be utilized 
with ambiguous and difficult problems in the 
diagnosis of brain damage. When perform- 
ance on the spiral was compared with pa- 
tients’ diagnosis in the present study, the 
efficacy of the aftereffect as an individual 
diagnostic instrument seemed dubious. The 
disparities between the results in the studies 
by Price and Deabler and the present work 
may be attributable to many factors. For one 
thing, the degree of obviousness of brain dys- 
function in the organic samples tested may 
have been at variance. In addition, some very 
real differences in methodology were em- 
ployed. The scoring scheme in the present 
study made no distinction between verbal re- 
ports of changes in size and depth. In the 
study by Price and Deabler, size changes 
were given greater recognition in scoring. The 
justification for each specialization in scoring 
was not apparent to the present authors and, 
consequently, any response which indicated 


Table 4 


Relationship Between Performance on the Spiral Task 
and Clinical Neurological Tests 





Contingency 
coefficient 


Signifi- 


Variable cance 





EEG Al 
Pneumoencephalogram 03 
Skull X ray 05 
Spinal fluid 24 
Visual fields 27 
Global analysis 25 
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an appreciation of negative perceptual experi- 
ence was fully accredited. In addition, the in- 
tention in the present study was to provide 
S with every chance to report an experience 
which may normally appear somewhat freak- 
ish and unnatural. Such an experience, it was 
reasoned, might be rather difficult to verbal- 
ize. Frequently it was found that Ss did quite 
well on the task only after they fully under- 
stood the inquiry. At times this required the 
complete battery of four questions in order to 
elicit a satisfactory report. It is suggested that 
some of the inability of the organically im- 
paired to report the aftereffect in Price and 
Deabler’s paper may have been in part a con- 
sequence of communication failure rather than 
perception failure. 

The visual acuity findings are further rea- 
son for doubting the legitimacy of the spiral 
aftereffect as a clinical measure of brain dam- 
age. The need to consider visual acuity in a 
study of perceptual processes is discussed by 
Yates (10) who admonishes against the omis- 
sion of a measure of visual acuity in the de- 
velopment of perceptual tests of brain dam- 
age. This is an especially pertinent recom- 
mendation in instances where the experimental 
sample includes disease processes such as 
multiple sclerosis, wherein repeated atrophy 
of the optic nerve may frequently occur (4, 
8). When atrophy develops, it seems to be 
present most often in the area of the maculo- 
papillar bundle, which is involved in central 
vision. Since the spiral task involves foveal 
vision, in a large measure, the need for an 
index of visual acuity is apparent. 

In a superficial attempt to explore further 
the relationship between visual acuity and 
brain damage, the Snellen chart readings were 
contrasted in the three groups differentiated 
in the global neurological appraisal. No rela- 
tionship was found between these estimates of 
brain damage and visual acuity. The question 
as to the nature of the relationship between 
visual acuity and performance on the spiral 
must come under closer scrutiny before the 
technique can be used with confidence in 
clinical practice. 


The Comparison of the Two Spirals 


The performance on the two spirals indi- 
cates that the level of difficulty in perceiving 
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the aftereffect, as well as the length of the 
aftereffect, was quite distinct. One of the best 
documented facts about the perseveration of 
movement afterimage is its dependence upon 
the velocity of the revolving stimulus. How- 
ever, this factor was controlled in the present 
study, except for changes in line voltage which 
should have affected both spirals without bias. 
The finding that a spiral which seems to be 
expanding has a longer period of after-activity 
is consistent with observations of duration in 
gamma movement studies. In the latter stud- 
ies, the expansion phenomenon, which occurs 
when a light source is introduced in a visual 
field, has been contrasted with the shrinkage 
which is reported when the light is dimmed or 
extinguished. As in the observations on the 
two spirals, the spreading movement occupies 
a significantly longer time interval than the 
shrinking movement. The two spirals were 
also compared to determine if they were 
equally effective in differentiating cortical 
damage as determined by the global clinical 
judgments. In spite of the fact that Spiral A 
elicited a greater absolute number of failures, 
there was no difference in efficiency of the 
two spirals in detecting brain damage. 


The Spiral Aftereffect and Vision 


The degree of relationship between the 
spiral performance and clinical visual tests 
inspired conjecture about the possible connec- 
tion between deficit on the spiral task and in- 
jury to areas of the optic apparatus. The fact 
that the task was mediated through visual 
structures added to the temptation to local- 
ize the brain damage. Accordingly, all the pa- 
tients whose clinical charts suggested a global 
evaluation of obvious or questionable brain 
damage were re-evaluated in an effort to esti- 
mate the localization of the presumed dam- 
age. However, the level of reliability for these 
judgments proved unsatisfactory, and the ef- 
fort was abandoned for the present paper. 

The evidence in this study suggested rather 
pointedly that two somewhat antagonistic as- 
pects of visual functioning were simultane- 
ously impaired in “Nonseers.” On the one 
hand the ability of the “Nonseers” to make 
fine visual discriminations (visual acuity) 
was notably lessened. Such a function, ac- 
cording to various theories of vision, should 
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require a high degree of insulation among 
juxtaposed individual pathways as well as a 
great number and superior differentiation of 
retinal receptors. The af‘ereffect task, on the 
other hand, would require either centrally or 
in the peripheral system rather complex neural 
interaction. Thus the dual failure of “Non- 
seers’ on visual acuity and spiral aftereffect 
tasks provides concurrent behavioral evidence 
of deficit in anatomical insulation and physio- 
logical interaction. 


Conclusions 


The purpose of this study was to evaluate 
the use of the spiral aftereffect as an index 
of brain damage in a neurological setting. In 
addition, certain neurological specifications of 
organic impairment were employed as criteria 
against which perceptual experience was meas- 
ured. And, finally, the direction of the coil of 
the spiral was varied, and the visual acuity 
of the subjects was tested in order to evaluate 
their effect upon performance. The results are 
as follows: 


1. Of a sample of 110 patients admitted to 
a neurological ward, 28% were unable to per- 
form satisfactorily on the spiral task. 

2. Those who performed satisfactorily on 
the spiral task had significantly better visual 
acuity as measured by Snellen chart perform- 
ance, but were not distinguishable in age from 
those whose performance on the spiral task 
was considered unsatisfactory. 

3. Whether the aftereffect took the form of 
expansion or contraction was important for 
the ease with which it was reported, as well 
as for the length of time it was subjectively 
perceived. 

4. Performance on the spiral was related to 
spinal fluid examinations, visual field studies, 
and global judgments of the likelihood of 
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brain damage. Performance on the spiral was 
not related to EEG reports, pneumoencepha- 
lographic studies, or skull X-ray reports. 
The degree of relationship demonstrated be- 
tween estimated brain damage and spiral per- 
formance was taken as vindicating further ex- 
ploration of the technique. It was suggested, 
however, that caution be exercised in the clini- 
cal use of the test, particularly in those situa- 
tions in which the demand for such a test 
would be the most imperative, i.e., in the 
evaluation of individual cases where the diag- 
nosis is less than straight-forward and direct. 
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The perception of visual apparent move- 
ment in normal subjects has been widely 
studied. It is known that when two lights are 
exposed alternately at a low rate of alterna- 
tion, the perceptual emergent will be that of 
two lights flickering alternately. If the rate of 
alternation is raised, the subject will report a 
single light which swings smoothly back and 
forth. This state is known as optimal move- 
ment or beta movement (5). If the frequency 
of alternation is again raised, the subject will 
report seeing two lights which flicker simul- 
taneously. It has also been noted that at the 
boundary between the disappearance of beta 
motion and the perception of simultaneous 
flicker, a formless dark object in motion may 
be perceived, called omega movement (4). 

Research (1, 2, 3, 7) indicated that one of 
the most common results of organic damage 
to the cortex is the loss or impairment of the 
ability to perceive visual apparent movement. 
Saucer and Deabler (5), in an experiment de- 
signed to explore the use of beta movement 
as a diagnostic procedure in this area, used 
the appearance of omega movement as an 
index of failure to perceive optimal move- 
ment. They found that normal or control sub- 
jects were able to perceive beta movement at 
a much higher rate than organics. 

They also found that a group of undiffer- 
entiated schizophrenics showed some degree 
of impairment in the ability to perceive opti- 
mal movement, although a group of loboto- 
mized schizophrenics showed no measurable 
deficit. The present paper is a report of an 
experiment designed to further explore the 
loss of ability to perceive optimal movement 
in schizophrenic individuals. 


The apparatus used in this study was a low fre- 
quency, square-wave generator, with a Hayden mo- 
tor sweeping from 2.5 to 10 cycles per second in 


three minutes. The square-wave generator gated two 
Sylvania R1130B glow modulator tubes 180° out of 
phase. A green “Dialite” fresnel lens (No. 34-841) 
was placed over the end of the R1130B tube. The 
fresnel lenses were 2.5 inches center to center and 8 
inches behind a #-inch opal Lucite screen 12 X 12 
inches square. The intensity of the lights was ad- 
justed to optimal movement for all subjects at 2.5 
cycles per second and a mean breakdown rate of 
beta movement for a pilot group of controls of about 
5 cycles per second. 


Subjects and Method 


Subjects (Ss) were 21 undifferentiated 
schizophrenics, 21 paranoid schizophrenics, 
and 10 control Ss. The patients were drawn 
from the NP population of Perry Point Vet- 
erans Administration Hospital, while the con- 
trol group was composed of hospital nursing 
assistants and other nonprofessional hospital 
personnel. Patients were selected by ward 
psychiatrists and their diagnosis made known 
after they were tested. 


The Ss were brought one at a time into the test- 
ing room and asked if they would consent to sitting 
in a dark room for a few minutes. Those who were 
mute or incooperative were released immediately. The 
remaining Ss were given three minutes of dark adap- 
tation, during which time the experimenter engaged 
them in trivial conversation. 

At the end of the three minutes, the stimulus lights 
were turned on at an alternation frequency of 2.5 
cycles per second and the S asked to tell the experi- 
menter what he saw. Almost invariably the S re- 
ported seeing a single light moving back and forth 
across the screen. The sweep drive was then turned 
on and the S asked to report any changes in the 
light. Most Ss immediately reported “it’s getting 
faster,” but a simple “yes” seemed to be an ade- 
quate explanation. 

When the S reported a dark object in motion, the 
sweep drive was stopped and the frequency in cycles 
per second read from a calibrated dial. If the S did 
not report a single light moving back and forth 
which was replaced by a dark object in motion, no 
score was given. 
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Results 


The rate at which omega movement ap- 
peared was taken as the rate at which optimal 
movement had ceased. The mean rates and 
their associated standard deviations are given 
in Table 1. 

The results are presented in Table 2 in 
terms of ¢ and the Mann-Whitney U sta- 
tistic (6, pp. 116-127). Although the differ- 
ences between the schizophrenic group and 
the paranoid and control groups are statisti- 
cally significant on the basis of the computed 
t, nonparametric analysis was also carried 
out because (a) it is evident that the variance 
is correlated with the mean, and (0d) nothing 
is known concerning the population distribu- 
tion of omega thresholds. The Mann-Whitney 
x/sigma values are equal to critical ratios 
when N is sufficiently large and hence may 
be interpreted by reference to a table of 


Table 1 


Threshold Means and Standard Deviations of Groups 


Mean SD 
3.428 
5.15 

5.417 


Group 





766 
1.39 
1.96 


Schizophrenic 
Control 
Paranoid 


x/sigma for a normal distribution. It can be 
seen that the values are sufficiently large to 
allow the null hypothesis of no true difference 
between the schizophrenic group and the con- 
trol and paranoid groups to be rejected at a 
comfortable level of significance. 

Since the statistical analysis does not clearly 
show the degree of overlapping between the 
different categories, an analysis by approxi- 
mate quartiles is given in Table 3. It can be 
seen from this table that whereas the ma- 
jority of the schizophrenics are below the cut- 
ting score of 4.130 cycles per second, the 
larger part of the paranoid Ss tested fall into 
the higher quartile ranges. If we wager that 
all scores below the median belong to schizo- 
phrenics and all high scores to paranoids, then 
we shall be correct in 33 cases out of 42. If 
we make the same wager for the normal- 
schizophrenic grouping, we shall be correct in 
25 cases out of 31. The associated percent- 
ages of correct prediction are 78.5% for the 


Table 2 


Significance of Mean and Rank Differences 


Signifi- 
cance 


Signifi- 


Groups compared cance 


x/sigma 


Schizophrenic- 

Control 3.67 001 3.80 
Paranoid-Control NS NS 
Schizophrenic- 


Paranoid 4.35 001 


schizophrenic-paranoid group and 80% for 
the schizophrenic-normal group where 50% 
correct predictions can be made by change 
alone. 
Discussion 

The primary hypothesis of a difference in 
apparent motion perception between normal 
and undifferentiated schizophrenics seems to 
be tenable. The fact that these differences are 
more marked in this experiment than in the 
earlier experiment (5) may be due to the in- 
crease in sample size. The combined findings 
from the two experiments suggest that there 
is a noticeable deficit in the perception of 
optimal movement by the schizophrenic. 

Since the results of the experimental pro- 
cedure do agree reasonably well with the pres- 
ence or absence of schizophrenia as estab- 
lished by other diagnostic criteria, the use of 
apparent motion perception as an added diag- 
nostic procedure seems warranted. At the 
same time, due to the use of relatively com- 
plex electronic test equipment, the method 
seems more useful in connection with research 
procedures than as a routine diagnostic test. 

The evidence concerning the “normal” per- 
ception of apparent movement in the paranoid 
schizophrenic is of considerable interest and 


Table 3 


Distribution by Cutting Scores 








Schizo- 
phrenics 


Para 


Cutting score noids Controls 





2.50-3.375 10 
3.384.125 . 
4.130-4.875 3 
4.88-12.00 0 
Total 21 
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merits separate discussion. The differences be- 
tween schizophrenics and paranoid schizo- 
phrenics are so great as to suggest the pres- 
ence of an entirely different pathology. It has 
been argued earlier (5) that the ability to 
perceive apparent movement is an index of 
cortical efficiency. If this is so, then the para- 
noid schizophrenic shows little evidence of 
lowered cortical efficiency in spite of the fact 
that as a group the paranoid schizophrenics 
tested showed considerable behavioral pa- 
thology. 

In a first approximation to a theory of 
paranoid schizophrenia, it may be assumed 
that there are certain individuals who are 
genetically equipped with nervous systems 
which are highly efficient in the organization 
of perceptual processes. This assumption is 
not entirely groundless, since it is also im- 
plied in a genetic theory of intelligence. 

Such an individual might be expected to go 
beyond the norm in organizing his perceptions 
into a coherent system. If there are severe per- 
sonality problems present, this structure may 
become contaminated by personal reference or 
bias. Since the bias introduces some distor- 
tion of reality, it may be expected to change 
the individual’s behavior towards the external 
world. This in turn begins to produce reac- 
tions on the part of society which must be 
incorporated into the meaningful if paranoid 
structure of perceptual organizations. The en- 
tire process may become circular in nature 
with a high degree of positive feedback. The 
accentuated emotional loading on the cortical 
processes will in time produce a schizoid be- 
havior pattern. 

Thus, the first stages of the pathology would 
be characterized by hyper-alert attitudes and 
complex perceptual organizations which would 
later be replaced by increasing loss of contact 
with reality and finally by random pathologi- 
cal behavior of a generalized hebephrenic na- 
ture. The presence of schizophrenic behavioral 
components in the paranoid individual may 
be viewed as a consequence of genetic acci- 
dent plus the personal and private emotional 
problems acting together in a reactive rather 
than in a process fashion to produce such be- 
havior. 

It is likely that a long range study of para- 
noid schizophrenics would show that in the 
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early stages of development they would show 
hypernormal ability to organize perceptual 
processes. In spite of behavioral pathology, 
they would not show “schizophrenic” deficit 
in this area. During the later course of the 
pathology, a decline in organizational ability 
would be accompanied by an increase in 
schizoid symptoms until finally the two pa- 
thologies could not be separated on a per- 
ceptual basis alone. 


Summary 

Earlier evidence has indicated that schizo- 
phrenics may show impaired ability to per- 
ceive visual apparent movement. This paper 
is a report of an experiment designed to ex- 
plore the degree of impairment in a large sam- 
ple of schizophrenics. 

There are statistically significant differ- 
ences between the undifferentiated schizo- 
phrenic group and the control group but not 
between the control group and the paranoid 
schizophrenic group. These findings support 
the theory of schizophrenic deficit outlined 
earlier by Saucer and Deabler (5) but require 
a new interpretation of the processes of para- 
noid schizophrenia. A tentative hypothesis 
concerning the nature of paranoid processes 
is discussed. 

It is concluded that apparent visual move- 
ment thresholds may be a useful diagnostic 
tool but may be better suited to research than 
to routine diagnostic testing. 


Received August 9, 1957. 
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Success and Failure in Client-Centered Therapy As a 
Function of Client Personality Variables’ 


William L. Kirtner and Desmond S. Cartwright 


University of Chicago 


Cartwright (5) has presented evidence for 
the existence of a “failure-zone,” ranging be- 
tween 13 and 21 interviews, in client-cen- 
tered therapy. In a subsequent paper, Taylor 
(25) presented closely comparable evidence 
with respect to psychoanalytically oriented 
psychotherapy. 

Cartwright demonstrated that therapeutic 
outcomes were divided by the failure-zone 
and suggested that there may be two forms 
of therapeutic process, identifying these proc- 
esses as “short” (1-12 interviews) and “long” 
(13-77 or more interviews). He further hy- 
pothesized (5, p. 362) “that certain indi- 
vidual differences between clients give rise to 
different kinds of therapeutic process.” 

On the basis of these results and hypothe- 
ses, five client groups were distinguished: 
short success, 1-12 interviews; short failure, 
1-12 interviews; failure-zone, 13-21 inter- 
views; long success, more than 21 interviews; 
and long failure, more than 21 interviews. 
Utilizing this schema, Kirtner (12) hypothe- 
sized that therapy length-by-outcome is re- 
lated to the personality structures of clients 
at the beginning of therapy. The present pa- 
per reports an investigation of this hypothesis. 


Method and Procedure 
Subjects 


From the research files of the Counseling 
Center, University of Chicago, all clients were 
selected for whom the following four condi- 


1 This investigation is based upon a dissertation 
submitted by the senior author to the University of 
Chicago in partial fulfillment of the requirements for 
the degree of master of arts. It was supported in 
part by research grants from the National Institute 
of Mental Health, National Institutes of Health, 
U. S. Public Health Service, and from the Ford 
Foundation. 
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tions held: (a) the therapists’ rating of out- 
come on a 9-point success rating scale (22) 
fell either between 7 and 9 inclusive (“suc- 
cess”) or between 1 and 4 inclusive (“fail- 
ure’); (5) the exact number of interviews 
was recorded; (c) a pretherapy Thematic 
Apperception Test (TAT) protocol was avail- 
able; and (d) a recorded or transcribed first 
therapy interview was available. Twenty-four 
clients fulfilled all conditions. Two clients who 
fulfilled conditions (a), (6), and (c), but 
not (d), were also included, yielding n = 26 
for this study. All clients were seen by ex- 
perienced therapists (25 male and 1 female) 
between 1949 and 1954. The sample included 
12 males, 14 females; 10 students, 16 non- 
students; the mean age was 27.5 years and 
the range 19 to 41 years. 


Instruments 


Rating scales were developed to measure 
twelve personality variables. These variables 
were selected to comprehend: (a) some of 
those variables widely referred to as desig- 
nating level or pattern of personality integra- 
tion, e.g., sex role and identification, motiva- 
tion, impulse life; (4) some of those variables 
particularly relevant to the phenomenological 
basis of client-centered therapy, e.g., decision- 
making, rule usage, sense of self as a causa- 
tive agent (16; 17, pp. 4-5); and (c) some 
of those variables especially associated with 
previous research in client-centered therapy, 
e.g., sensitivity to others (23, 24, 26), intra- 
punitive feelings (9). Upon analysis of the re- 
sults derived from the application of the 12 
scales, 6 scales were eliminated due to func- 
tional overlap with the remainder. Numerical 
ratings made by an independent judge on the 
12 scales were intercorrelated. The resulting 


59 





260 


matrix was inspected for clusters from each 
of which one scale was selected, yielding four 
relatively independent scales, numbered I, 
IV, X, and XI. Scale VIII, while substan- 
tially correlated with Scale I, was uncorre- 
lated with the other three selected scales and 
is therefore reported upon. Scale XII, highly 
correlated with Scale XI on numerical rat- 
ings, is also reported upon because of its 
distinctive qualitative differentiation between 
clients. Over the six scales, the range of inter- 
correlations was — .24 to .77. The median 
correlation was .29. 

Following are brief descriptions of the six 
scales reported in this study. In complete 
form, all are 5-point, fully anchored scales.’ 
A rating of 1 indicates minimal disturbance 
(optimal adjustment), and a rating of 5 in- 
dicates maximal disturbance. 


Scale I—Impulse Life 
From free flow and expression of generative im- 
pulses and ideas, to extreme: 
a. fright about impulses but with expression; 
b. control or repression; 
c. distortion of impulses. 


Scale ITV—Sense of Capability 

From a sense of felt capacity to cope adequately 
with situations, to a sense of helplessness and lack of 
internal resources to cope with situations. 


Scale VITI—Sex-role and Identification 

From direct and open expression of sexuality with 
good control of feelings, to extreme conflict, uncer- 
tainty, and confusion about sex. 


Scale X—People—Object Orientation 
From the ability to relate deeply both to people 
and in activities, to: 
a. a desperate need for people; 
b. avoidance of relating to people with con- 
comitant desperate need for activities; 
c. immobilization. 


Scale XI—Sense of Comfort and Satisfaction 
From a sense of self-comfort and satisfaction with 
the ability to locate problem sources both internally 
and externally in good balance, to: 
a. intensely driven to search himself for causes 
of discomfort and dissatisfaction; 
b. intensely driven to search outward for causes 
of discomfort and dissatisfaction ; 
c. intense hopelessness to locate causes. 


2 Copies of the complete scales and of the raw 
data from both judges may be obtained without 
charge from William L. Kirtner, Counseling Center, 
University of Chicago, 5737 Drexel Avenue, Chicago 
37, Illinois, or for a fee from the American Docu- 
mentation Institute. Order Document No. 5585, re- 
mitting $1.75 for microfilm or $2.50 for photocopies. 
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Scale XII—Punitive Feelings 
From a self-appraising person who is not self- 
reproachful, to: 


a. extremely self-punishing person; 

6. extreme anticipation of punishment from ex- 
ternal source; 

c. vacillation between “a” and “b.” 


It will be noted that some of the scales in- 
clude alternate descriptive anchors, indicated 
by letters a, b, c. Such letter ratings occur 
only in connection with numerical ratings of 
3 or more. As the level of disturbance in- 
creases in a given area (as the numerical rat- 
ing is higher on these scales), the behavior 
displayed may take on one of several modal 
characteristics. Salient nonreactive, explosive, 
or other features of behavior become usual 
and representatively distinguishing attributes 
of the person rated. Such salient character- 
istics or modal patterns of behavior must be 
discretely described in order to obtain a fully 
anchored scale of levels of disturbance. As the 
level of disturbance decreases (as the nu- 
merical rating is lower on these scales), the 
behaviors displayed do not require multiple 
discrete descriptions; salient nonreactive, ex- 
plosive, or other features of behavior are 
neither usual nor representatively distinguish- 
ing attributes of the person rated. 

While the foregoing paragraph presents the 
descriptive (i.e., scale-making) problem, the 
existence of the problem invites explanation. 
The authors hold the working hypothesis that 
the individual adopts certain specifiable modes 
or techniques of behavior in order to control 
his internal disturbance or anxiety with re- 
spect to a given segment of his experience. 
As anxiety or disturbance intensifies, the tech- 
niques required for control of the disturb- 
ance, or for maintenance of personality or- 
ganization, become more drastic and more 
representatively typical of the individual. 


Ratings 


Ratings of each client, on each of the six 
scales, were made on the basis of the pre- 
therapy TAT protocol in conjunction with 
the recorded or transcribed first therapy in- 
terview. In two cases, the recorded first in- 
terview was not available, and the TAT pro- 
tocol alone was used. 

Ratings were made independently by the 
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senior author and by another judge. The lat- 
ter (“Judge II’) knew neither the length nor 
the success rating of any case. 


Reliability 


Reliability of the therapist rating of out- 
come on the 9-point scale has been previously 
reported (5, 22) as slightly better than r = 
80. To determine reliability for the numeri- 
cal ratings on the six scales, all ratings by 
the two independent judges were correlated 
using Kendall’s tau for ties in both rank- 
ings (10). The computational procedure em- 
ployed was that developed by Cartwright 
(3). Corrections for continuity were made 
using Schaeffer and Levitt’s (21) modifica- 
tion of Whitfield’s correction (27). The rou- 
tine for computing corrections for continuity 
and /p values was that described by Cart- 
wright (4). Table 1 presents these reliabil- 
ity data, which were considered adequate 
throughout. 

Letter ratings occurred on four out of the 
six scales. Since such ratings are associated 
only with numerical ratings of 3 or more on 
these scales, a long and tedious problem in 
combinatorics would have to be solved to ob- 
tain an exact estimate of reliability, which 
did not seem justifiable. A fortiori, it did not 
seem justifiable to work on a solution for the 
reliability of joint numerical-plus-letter rat- 
ings. Instead, simple percentages of agree- 
ment were computed for all cases where both 
judges employed a letter rating, and regard- 
less of numerical rating. Table 2 presents 
these data and includes, for the reader's in- 
formation, the number of cases where neither 
judge used a letter rating. 

No statistical analysis of results from let- 


Table 1 


Reliability Coefficients (Tau) for Numerical 
Ratings on Six Scales 


Scale 





. Impulse life 

/. Sense of capability 
Sex-role and identification 

. People-object orientation 

. Sense of comfort and satisfaction 
Punitive feelings 





Table 2 


Percentages of Agreement for Letter Ratings on 


Four Scales 


Number of cases for 


which letter ratings 


XI 
XII 


ter ratings was made. However, the authors 
judge the data of Table 2 to warrant descrip- 
tive characterization of modal behaviors dis- 
played by subjects rated on the four scales 
referred to. 


Analysis 


The sample of 26 cases was divided into 
length-by-outcome groups according to the 
schema given. Table 3 summarizes descrip- 
tions of the resulting groups. 

Regrettably for the research, only one long- 
failure case was among those available. This 
group was therefore eliminated from subse- 
quent analysis. The remaining groups were 
examined in all possible pairs on each of the 
six scales as numerically rated by Judge II. 
All comparisons were made using tau for a 
dichotomy in one ranking as described by 
Kendall (10, pp. 32-34). The significance of 
differences between groups was computed in 
the manner previously indicated for the reli- 
ability analysis. In order to show the direc- 
tion of differences between pairs of groups, 
mean values for each group on each scale 
were also computed. 


Results 


Numerical Ratings 


Table 4 presents means for groups on each 
of the six scales and the p values of the dif- 
ferences between groups.* 

Rejecting the null hypothesis if » < .05, 
each success group shows a lesser degree of 


8 See Footnote 2. 
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Table 3 


Summary Description of Groups 





Number of 
interviews 


Range of out- 


Group come ratings 





Short failures 
Short success 
Failure-zone 
Long failure 
Long success 


<13 
<13 
13-21 
>21 
>21 








Non- 


Male Female Student student 





6 


1 
6 
0 
2 








disturbance than either failure group on 
Scale I. Also, the short-success group shows 
a lesser degree of disturbance than either 
failure group on Scale VIII. 

Of special interest are results setting a 
given group apart from the other three groups. 
We reject sets of three null hypotheses if in 
each instance p < .10. On this basis, Table 4 
contains the following results: the short-suc- 
cess group shows less disturbance than any 
other group on Scales I and VIII; the short- 
failure group shows more disturbance than 
any other group on Scale IV; and the failure- 
zone group shows more disturbance than any 
other group on Scale VIII. 


Letter Ratings 


Table 5 presents the number of letter rat- 
ings agreed upon by both judges for four 
scales with respect to the four groups. 

The most interesting feature of Table 5 is 
that success groups appear to differ from fail- 
ure groups on Scales X, XI, and XII. While 
there were no differences on these scales as 
numerically rated for degree of disturbance, 


differences in typical mode of controlling 
inner disturbance were observed by both 
judges. 


Distinguishing Descriptions 


The results and observations mentioned in 
connection with Tables 4 and 5 are given 
embodiment in descriptions distinguishing the 
groups. Descriptions employ the language of 
those scales, and only those scales, for which 
either statistically significant differences or 
clear qualitative observations have been made. 

The short-success group shows a generally 
higher level of personality integration than 
that descriptive of other groups. In particu- 
lar, they are rather open to their impulse life 
(I), and, while showing some distortion and 
uncertainty, they are a good deal less con- 
fused about their sex role than other groups 
(VIII). 

Though similar to the short-success group 
in many respects, the long-success group 
shows a somewhat lower level of personality 
integration in general. In particular, they are 
rather anxious about their impulse life (I) 


Table 4 


Significance Levels of Differences Between Groups on Six Scales, and Means Indicating Direction of Differences 








Means of Groups? 


Scale SS LS SF 


FZ 





I. 2.00 
IV. 3.00 
VIII. 2.50 
X. 3.50 
XI. 3.75 
XII. 3.25 


3.00 
3.25 
3.75 
3.75 
4.50 
4.25 


4.10 
4.30 
3.80 
4.10 
3.90 
3.80 


4.43 
3.71 
4.57 
4.00 
3.57 
3.85 





* Two-tailed tests throughout. 


<.10 
>.20 
<.07 
>.20 
<.15 
<.19 





p of Differences* 


SS vs. LS SSvs.SF SSvs. FZ LS vs. SF LS vs. FZ SF vs. FZ 





<.01 
>.20 
<.10 
>.20 
<.08 
>.20 


<.01 
<.03 
>.20 
>.20 
>.20 
>.20 


<.01 
<.02 
<.02 
>.20 
>.20 
>.20 


<.01 
>.20 
<.02 
>.20 
>.20 
>.20 





» SS—short success; LS—long success; SF—short failure; FZ—failure-zone. 
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Table 5 


Letter Ratings Agreed on by Both Judges 


Short 
success 


Short 
failure 


Long 
success 


Failure 
zone 


abe ab 





‘ 2 
X. 1 
XI. 
XII. 


and rather confused about the meaning and 
place of sex in their identity (VIII). 

Both success groups differ from each failure 
group in showing less disturbance in their im- 
pulse life (1). Letter ratings show the success 
groups typically displaying a strong need for 
people to relate to in order to feel worthwhile 
(X), seeking internally for the cause and 
resolution of felt discomfort (XI), and rather 
strongly condemning, berating, and disvalu- 
ing themselves (XII). 

The short-failure group shows a generally 
low level of personality integration. Especially 
distinguishing is their extreme underlying 
sense of incapacity to deal with their life 
situations (IV). 

The failure-zone group also shows a gener- 
ally low level of personality integration. Es- 
pecially distinguishing is their extreme con- 
flictedness about and rejection of their sex 
role (VIII). 

Both failure groups show greater disturb- 
ance than either success group on several 
scales. They are extremely disturbed in their 
impulse life (1), with expressions frequently 
occurring in over-controlled or distorted mo- 
dalities, as indicated by letter ratings. Over- 
control seems somewhat more typical for the 
failure-zone group. Letter ratings show the 
failure groups typically looking outward for 
the cause and resolution of their discomfort 
(XI), and typically anticipating punishment 
from external sources (XII). 


Discussion 


The foregoing results seem unambiguously 
consistent with the general hypothesis moti- 
vating the study: that therapy length-by-out- 
come is related to the personality structure of 
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clients at the beginning of therapy. While 
caution must be invoked due to the small 
sizes of the groups studied, the results add to 
the growing body of knowledge about factors 
which favor or limit therapeutic change in 
client-centered therapy. Earlier work in this 
area has been summarized by Rogers (17, pp. 
423-424, 427-428). 

It may be noted that the present approach 
constitutes a synthesis of two trends in previ- 
ous work on the general problem. One trend 
has dealt with the relations between prether- 
apy characteristics of clients and eventual 
ratings of improvement or outcome (e.g., 1, 
2, 8, 11, 15, 19, 26). The other trend has 
dealt with the relations between pretherapy 
characteristics of clients and their length of 
stay in treatment (e.g., 7, 13, 14, 18, 20). 
To our knowledge, only one previous investi- 
gator, Dana (6), has attempted to predict im- 
provement and length of treatment jointly. 
Due to the findings of Cartwright (5) and 
Taylor (25), a fusion of the two main trends 
into systematic emphasis upon length-by-out- 
come groupings now seems necessary. More 
generally, it is necessary to predict both 
amount and rate of change constituting im- 
provement. While something of a beginning 
has been made in this direction by Dana’s re- 
sults and by those of the present study, much 
more needs to be done. 


Summary 


Twenty-six clients, who had completed a 
course of client-centered therapy at the Coun- 
seling Center, University of Chicago, were 
divided into five groups on the basis of length- 
by-outcome: 


Short success (1-12 interviews)—N equals 4 

Short failure (1-12 interviews)—N equals 10 

Failure-zone (13-21 interviews)—N equals 7 

Long success (more than 21 interviews)—N 
equals 4 

Long failure (more than 21 interviews)—N 
equals 1 


Twelve personality variables were selected, 
and 5-point rating scales were constructed to 
describe both intensity and mode of behav- 
ior with respect to each of the 12 variables. 
These scales were applied independently by 
two judges to the pretherapy Thematic Ap- 
perception Test protocol in conjunction with 
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the recorded or transcribed first therapy in- 
terview in each case. Interjudge agreement 
was deemed adequate throughout. 

The ratings made by an independent judge 
were intercorrelated, and the resulting matrix 
was inspected for clusters from each of which 
one scale was selected, yielding four relativ.'y 
independent scales. Two further scales were 
reported upon because, though substantially 
correlated with one of the four selected scales, 
they are uncorrelated with the others and 
point up interesting and distinctive differ- 
ences between client personality structure. 

It was thus found that therapy length-by- 
outcome is related to the personality struc- 
ture of clients at the beginning of therapy. 
The most marked differences found on these 
scales were those between success groups and 
the failure groups. 


Received July 31, 1957. 
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As a technique for producing changes in the 
personality structure of a patient, psycho- 
therapy does not exist apart from the person 
of the therapist. While it is axiomatic that the 
therapeutic instrument comprises the thera- 
pist, his personality, theory, and techniques, 
little systematic attention has been paid by 
researchers to these variables. 

In a sense, the therapist is a reader of im- 
aginary dials who acts “therapeutically” upon 
the information indicated by these measuring 
instruments within himself. When supplied 
with certain information, say, the behavior 
of a patient, he will register this information 
in certain ways, and he will react to it in cer- 
tain other ways. The therapist’s theory, ex- 
perience, and personality determine the kinds 
of dials he uses as well as their calibration. 
His therapeutic actions are their reflection. 

In order to obtain empirical data on the 
calibration of the therapist as a clinical in- 
strument, we can supply a certain identical 
input (patient behavior) and note (a) how 
this input is perceived and evaluated; (db) 
what deductions and inferences are drawn 
from it; and (c) how these processes are 
translated into therapeutic action. On the hy- 
pothesis that this testing procedure will dis- 
close differences among therapists, one can 


1 This paper was presented at the 1957 meeting of 
the Eastern Psychological Association. 

This research, part of a larger project supported 
by a research grant (M-965) from the National In- 
stitute of Mental Health, Public Health Service, was 
conducted while the author was affiliated with The 
George Washington University, School of Medicine. 

2 The author is indebted to Rebecca E. Rieger for 
her assistance and valuable comments. 


investigate systematic influences which might 
produce the discrepancies. For example, thera- 
pists may use different sets of measuring in- 
struments whose readings have no relation to 
each other. They may use similar instruments, 
but the calibration may differ. They may in- 
terpret differentially the meaning of the same 
pointer reading. Any one of these possibili- 
ties, plus many others, may result in divergent 
therapeutic communications. 

This analogy is of course a gross oversim- 
plification of the therapist’s activity in the 
therapeutic situation, but it may help to make 
explicit the objectives of the larger investiga- 
tion of which this paper forms a part. The 
purpose was to study the therapist’s perform- 
ance in a first interview and to ask: If a 
group of therapists observe the same patient, 
how do they perceive the clinical problem? 
What kinds of evaluations result from their 
perceptions? How are their perceptions and 
evaluations related to their communications? 
And, can differences in their performance be 
accounted for in terms of common underlying 
variables? Specifically, this paper is concerned 
with a comparison between psychotherapists 
(psychologists) following psychoanalytic prin- 
ciples and therapists following the client-cen- 
tered theory of Rogers. 


Method and Procedure 


To study the performance of psychothera- 
pists under comparable conditions, a sound 
film of a first interview was used as an ex- 
perimental model. The film was produced by 
the Veterans Administration for training pur- 
poses and was entitled “A Clinical Picture of 
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Claustrophobia.” * It was described as an un- 
rehearsed interview from which no significant 
portions had been deleted. The motion pic- 
ture was adapted for experimental purposes 
by introducing interruptions at 28 preselected 
points identified by a uniform title “What 
would you do?” followed by a number from 
1 to 28. 


Synopsis. The patient, a middle-aged man, com- 
plains of intense anxiety and gastrointestinal symp- 
toms which appear to be psychogenic. He relates an 
episode of a panic reaction on a crowded street car. 
Following the incident he found himself unable to go 
out of the house unless he was accompanied by his 
wife, an invalid of many years. As the interview 
progresses, he dwells on an intense conflict with his 
mother, a strong-willed woman, and speaks at length 
about his “bad luck” in all phases of his life. The 
father had died when the patient was young, leaving 
him with family responsibilities for which he was 
unprepared. He feels that “I had three strikes against 
me before I came to the plate.” He speaks freely, 
and his anger and resentment against his mother, 
brother, and people in general build up. The patient 
appears hostile, demanding, and dependent; he ver- 
balizes a feeling that the world owes him a living. 
While it is not clear that he had psychotherapy be- 
fore, he makes mention of previous interviews. He 
speaks of having obtained some “insight,” of which 
his rationalizations and projections give little evi- 
dence. The interview ends somewhat abruptly after 
he has made a passionate plea for help, couched in 
a demand for reassurance. 

The interviewer was an inexperienced resident in 
psychiatry who essentially permitted the patient to 
tell his story. He frequently repeated a word or 
phrase of the patient’s narrative but focused very 
little. At times, he asked informational questions 
which seemed to have the effect of diverting the pa- 
tient from an emotionally charged topic. The inter- 
viewer’s attitude may be described as benevolent but 
rather distant. He was passive, offered no reassur- 
ance and seemed ill at ease. No clues were given 
about the outcome of the interview. 


The film was shown to groups of therapists 
who were asked to assume that they were in- 
terviewing the patient, and to record in writ- 
ing their communications to the patient while 
the film sequence was interrupted. Each pause 
lasted 30 seconds, which provided most thera- 
pists ample but not excessive time to make 


8 The “Psychotherapeutic Interviewing Series” was 
directed by Jacob Finesinger and Florence Powder- 
maker. The film under discussion, Part V of the se- 
ries, was filmed in 1952. I wish to express my ap- 
preciation to the Psychiatry and Neurology Service 
of the Veterans Administration for granting permis- 
sion to use the film for experimental purposes. 
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their comments. Respondents were free to re- 
spond or to remain “silent.” The experimental 
version of the film ran continuously and lasted 
approximately 50 minutes. 

Following the film showing, therapists were 
requested to complete a comprehensive ques- 
tionnaire on diagnostic impressions, treatment 
plans and goals, formulations of the patient’s 
emotional dynamics, problems in treatment, 
estimates of the degree of any‘ety, emotional 
maturity, social adjustment, prognosis, the 
respondent’s attitude toward the patient, etc. 
In addition, there was a biographical infor- 
mation blank containing 24 questions about 
the therapist’s training, theoretical orienta- 
tion, and current therapeutic activities. 

The experimental procedure usually re- 
quired about two hours. 


The Samples 


Cooperation was obtained from psychiatric 
and psychological societies, hospitals, training 
centers, and therapeutic institutes. Data were 
collected in Washington, Baltimore, New 
York, Perry Point, and Chicago. The total 
sample comprised 235 psychotherapists, 134 
of whom were members of the medical pro- 
fession (analysts, psychiatrists, and residents 
in psychiatry); 17 were psychiatric social 
workers. This paper is concerned with the 
responses of 14 psychologists who described 
their theoretical orientation as unqualifiedly 
Rogerian (8 were affiliated with the Counsel- 
ing Center of the University of Chicago). 
This group is compared to 64 psychologists 
who professed to follow psychoanalytic prin- 
ciples broadly defined. Actually, they selected 
a total of 93 “labels,” which were distributed 
over 14 categories; 72% of the choices were 
accounted for by: psychoanalytic orientation 
(general or orthodox), Sullivan, and eclectic.‘ 

Summary statistics on age, experience, and 
personal analysis are presented in Table 1. 
The following data shed some further light on 
the composition of the samples: 


4 Grateful acknowledgment is made to all psycholo- 
gists who gave generously of their time to make this 
study possible. Thanks are also due the following or- 
ganizations: VA Benefits Office, Washington, D. C.; 
VA Hospital, Perry Point, Md.; Maryland Psycho- 
logical Association; Postgraduate Center for Psycho- 
therapy, New York, N. Y.; Counseling Center, Uni- 
versity of Chicago. 
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Table 1 


Summary Statistics of Age, Experience, 
Persona! Analysis 


Sample R® 


Statistic Ne Mean } N SD 





Age 65 35.6 yrs. 13 334yrs. 64 


Experience 


intherapy 62 3.3 yrs. ; 12 30yrs. 24 


Personal 


analysis 33 291hrs. 198 9 142hrs. 





* Psychoanalytically oriented psychologists. 

b Rogerian psychologists. 

* Fluctuations reflect absence of complete information and/or 
not having had personal analysis. 


Two-thirds of the psychoanalytically ori- 
ented psychologists (Sample A) held the 
Ph.D. degree, as compared with 36% of the 
psychologists who followed Rogers’ theory 
(Sample R). 

Sixty per cent of the members in both 
groups said they had a fair amount or a 
great deal of experience in therapy and con- 
sidered themselves competent therapists. All 
members of Sample R said they practiced 
client-centered therapy exclusively. Sample A 
showed a wider distribution, but most replies 
fell within the broad category of psychoana- 
lytically oriented psychotherapy. Most Roger- 
ians indicated that they had received very 
little training in psychoanalytic principles 
and techniques; 69% of Sample A claimed 
“a fair amount” or “a great deal.” 

Sixty-four per cent of Sample R had re- 
ceived their personal analysis within the 
client-centered framework. The responses of 
members of Sample A who indicated that they 
either had completed or were in the process of 
undergoing personal analysis (more than half 
the group) suggested that their analyses were 
based upon psychoanalytic theory. 

Approximately one-half of the members in 
both groups stated that they devoted more 
than 25% of their time to the practice of psy- 
chotherapy. Almost two-thirds of Sample R 
specified their therapy as being primarily with 
neurotic patients. Thirty-eight per cent of 
Sample A responded similarly, but another 
third said that their patients represented a 
wider range of personality disorders. 
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Sample A appeared to treat their “average 
patient” for longer periods than Sample R: 
38% of the former group described the length 
of therapy as exceeding one year, as con- 
trasted with 14% in the latter. On the other 
hand, Rogerians predominantly saw the “av- 
erage patient” twice a week, whereas the 
modal frequency for Sample A was closer to 
one weekly session. 


Results 


Therapists’ responses to the film, that is, 
communications to the patient, were quanti- 
fied by a multidimensional system of analysis 
which has been described previously (11). 
Questionnaire responses to each item were 
coded and the frequencies tabulated. In view 
of the small size of Sample R, response cate- 
gories within an item had to be combined. 
Typically, this resulted in 2 x 2 tables which 
were tested by means of chi square (corrected 
for continuity).° 


Clinical Evaluations and Therapist Attitudes 


Responses to the question of diagnosis re- 
vealed no significant differences. It seems note- 
worthy, however, that no member of Sample 
R chose such labels as psychopath, schizoid, 
paranoid, or character disorder, which were 
used by appreciable minorities of Sample A, 
as well as by psychiatrists whose response 
distributions have been presented elsewhere 
(12). Asked to state the patient’s major de- 
fense mechanisms, Sample R named a rela- 
tively smaller number than Sample A (ratio 
of 1:3), but the distributions did not appear 
to differ significantly. Nor did the samples 
differ in terms of qualitative descriptions of 
the patient’s emotional dynamics. 

Evaluations of the degree of ego strength, 
anxiety, insight, social adjustment, and dis- 
turbance disclosed no statistically significant 
differences. 

The members of Sample R judged the prog- 
nosis with therapy to be more favorable than 
the members of Sample A (p = < .05): no 


5 A table showing the response distributions of the 
two samples in percentages and giving probability 
values for p= <.10 has been deposited with the 
American Documentation Institute. Order Document 
No. 5587, remitting $1.25 for microfilm or $1.25 for 
photocopies. 
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Rogerian therapist considered it unfavorable. 
By contrast, an unfavorable prognosis was 
given by 40% of the therapists in Sample A, 
who were in closer agreement with the predic- 
tions made by psychiatrists. 

A larger proportion of Rogerian therapists 
professed a positive attitude toward the pa- 
tient than analytically oriented psychologists 
(p = < .02). Members of the University of 
Chicago Counseling Center unanimously de- 
scribed their attitude as positive. Those re- 
sponding negatively often stated that their 
attitude was influenced by the patient’s hos- 
tility, anger, and dependency; those respond- 
ing positively stressed his personality assets 
or saw him as “a patient with a problem,” a 
response often given with some empathy. 


Plans, Goals, and Difficulties in Therapy 


Sample R showed greater reluctance to set 
up therapeutic goals than Sample A (p = < 
.001), but the two samples did not differ sig- 
nificantly as to estimated length of treatment 
or time needed to accomplish symptom relief. 
It seems pertinent to note that psychoana- 
lysts were less optimistic than either of the 
psychologist groups. 

Concerning difficulties expected in treat- 
ment, Sample R therapists were significantly 
less likely to mention: the patient’s parasitic 
attitudes, poor motivation for therapy, acting 
out, or the possibility of psychosis (p = < 
.02). Nor were countertransference or prob- 
lems of technique mentioned with any degree 
of frequency by Sample R (p = < .01). One- 
half of Sample R anticipated countertransfer- 
ence problems with this patient, as compared 
with almost 80% in Sample A; twice as many 
Rogerians answered the question in the nega- 
tive (p= < .07). No Rogerian mentioned 
areas in the patient’s living which should be 
avoided, at least early in therapy, whereas 
numerous areas were mentioned by Sample A 
therapists (p = < .001). Similarly, Rogerians 
were less likely to name areas in the patient’s 
living which should be singled out for focus 
in therapy (p = < .001). 

Rogerians either declined to specify atti- 
tudes or behaviors which the therapist should 
encourage in therapy with his patient or 
stressed the expression of feelings. Sample A 
therapists were more likely to stress: a sense 
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of responsibility, increased socialization, relat- 
ing feelings and symptoms to interpersonal 
situations, etc. (p = < .02). 

Conversely, Sample A therapists were more 
likely to discourage attitudes and behaviors, 
such as: intellectualization, obsessive rumina- 
tions, self-pity, self-depreciation, helplessness, 
refusal to accept responsibility, demanding 
attitudes, and acting out; Rogerians, by com- 
parison, tended to say they would discourage 
nothing or leave it to the patient (p= < 
001). 

No member of the Rogerian group advo- 
cated strictness by the therapist, as contrasted 
with more than a third of the members of 
Sample A who considered strictness thera- 
peutically desirable (p= .02). A question 
about therapist’s activity or passivity did not 
differentiate the two groups, possibly because 
the terms of reference were not clearly de- 
fined. Free association was discouraged by an 
appreciable proportion of Sample A thera- 
pists, but not by Sample R (p = < .06). 

A question was asked about clues in the 
patient’s verbal or nonverbal behavior which 
might not be used immediately but kept in 
mind for possible later reference. All mem- 
bers of Sample A who answered the question 
(85%) mentioned one or more clues. Four 
Rogerians specifically stated “none” (p = < 
.001). Rogerians commented about the pa- 
tient’s feelings and attitudes but, in contrast 
to Sample A, paid less attention to such clues 
as: gestures, bodily movements, manner of 
speaking, the patient’s past or present inter- 
personal relations with his mother, wife, 
brother, or father—with or without inference 
about their dynamic significance. 

Descriptions of the course of the transfer- 
ence did not diverge significantly, but mem- 
bers of the two samples disagreed about ap- 
proaches to the problem (p = < .001). Rog- 
erians predominantly encouraged or fostered 
a “corrective emotional experience” by con- 
veying respect or clarifying feelings. Members 
of Sample A, on the other hand, tended to ad- 
vocate interpretation and other procedures, 
such as reassurance and firmness, in which 
the therapist uses his role to induce changes 
in attitudes and actions. 

Responses to a question dealing with “act- 
ing out,” an aspect of the transference prob- 
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lem, revealed a similar pattern: no statisti- 
cally significant differences with respect to its 
occurrence, but divergence in terms of “han- 
dling.” Here, too, Rogerians recommended 
understanding, clarification, and reflections; 
analytically oriented therapists preferred an 
interpretive approach (p = < .001). 


Evaluation of and Attitude Toward the Film 
Therapist 


A number of questions were included to 
study similarities or differences between the 
film therapist’s and the respondent’s ap- 
proaches to interviewing. It was hoped that 
in this way it might be possible to obtain a 
clearer picture of the respondent’s own pro- 
cedure. 

Sample R therapists were more definite in 
their assertion that they would have con- 
ducted the interview “in a very different man- 
ner” (p= < .02). They dissociated them- 
selves from his approach and tended to evalu- 
ate his performance as “a very inadequate 
job”; the majority of therapists in Sample A 
considered his performance reasonably ade- 
quate (p= < .01). 

Within each sample, there was notable dis- 
agreement as to whether the therapist had 
been too passive or given insufficient support, 
but the intersample comparisons were non- 
significant. 

Rogerians and psychoanalytically oriented 
therapists disagreed very markedly about the 
amount of time they would have devoted to 
obtaining data on the patient's life history: 
75% of Sample R stated that they would have 
spent somewhat less or considerably less time 
than the film therapist; almost one-half of 
Sample A said they would have devoted some- 
what more or considerably more time (p = 
< .001). The transcript shows that about 
10% of the therapist’s interventions can be 
construed as case history questions in the 
usual sense. These questions were usually 
asked in the context of the patient’s story, 
and attempted to clarify a time, place, or in- 
terpersonal event. On the other hand, the 
therapist frequently repeated a word or phrase 
in a questioning tone of voice, apparently to 
obtain further data on a feeling or attitude. 

The two samples revealed a statistically 
highly significant difference in terms of their 


269 


attitude toward the therapist. The majority of 
Rogerian therapists described their attitude as 
negative, whereas most analytically oriented 
therapists professed a positive or neutral atti- 
tude (p = < .001). 

A supplemental question disclosed that the 
respondents’ criticisms were directed at the 
film therapist’s technique as well as at his 
person. Prominently mentioned by both sam- 
ples were the following objections: Therapist 
was not in control of the situation, engaged 
in too much activity but not of the “right” 
kind or not at the “right” time, too much in- 
terference with patient’s talk, too much “‘ac- 
centing” (repeating last word), wrong em- 
phasis, no plan. Rogerians criticized him es- 
pecially for failing to focus on the patient’s 
feelings and attitudes, and for a lack of em- 
pathy. 


Interviewing Techniques 


Analyses of therapists’ communications to 
the patient disclosed that Rogerians strongly 
favored the reflection-of-feeling technique, 
which comprised 67% of their responses. The 
comparable percentage for analytically ori- 
ented psychologists was 17. The technique 
preferred by Sample A consisted of explora- 
tory questions (40%). The number of “si- 
lent” responses given by Sample R was sig- 
nificantly smaller than that of Sample A. 
Communications showing a high degree of in- 
ference or initiative were rare in Sample R; 
Sample A had 3 to 8 times as many responses 
in these ranges. 

As might be expected, Sample R therapists 
adhered quite consistently to the reflection- 
of-feeling technique, even when the patient 
pleaded for a direct answer from the thera- 
pist. Many analytically oriented therapists 
responded to this plea with a direct answer. 


Discussion 


In interpreting these findings it must be 
kept in mind that the sample of Rogerian 
therapists was small. This deficiency was 
partly mitigated, however, by the fact that 
this group was compared with a considerably 
larger sample of analytically oriented thera- 
pists to whom they were roughly comparable 
in terms of training and level of experience. 
Furthermore, on a number of questionnaire 
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items, Sample R revealed almost complete 
unanimity which undoubtedly reflected homo- 
geneous training in a single setting (The Uni- 
versity of Chicago Counseling Center); Sam- 
ple A, on the other hand, was made up of 
therapists from a variety of backgrounds. It 
may be noted that psychoanalysts (members 
of the American Psychoanalytic Association) , 
whose training might be considered reason- 
ably homogeneous, varied more widely in 
their responses. 

The results showed a number of pronounced 
differences in approach which highlighted dis- 
crepancies between therapists subscribing to 
psychoanalytic principles and the client-cen- 
tered framework. Both positions are too well 
known to require exposition in this context. 
It seems appropriate, however, to quote what 
Rogers considers to be the distinctive empha- 
ses of client-centered therapy: 


A second distinctive aspect [the first is the em- 
phasis on research] is the deep confidence in the ca- 
pacity of the individual, which client-centered thera- 
pists have found to be justified by their experience. 
Rather than seeing man as essentially a destructive 
animal [the reference is evidently to the death in- 
stinct postulated by Freud late in his career, one of 
the most disputed and least accepted tenets in con- 
temporary analytic thinking], with the primary prob- 
lem of the control of his impulses, we have found 
man to be essentially constructive, if he can be re- 
leased from his defensiveness. Our experience is that, 
to the extent that he can fully accept his experience 
and himself, he becomes satisfactorily self-controlled. 
Our view of the basic nature of man differs sharply 
from those of most therapeutic orientations, which 
believe that guidance and control must be arranged 
by the expert. 

A third point of difference relates to a difference 
in theory. The theory of therapy which is developing 
in client-centered therapy stresses the importance of 
the immediate moment of experience in the relation- 
ship, not the genetic causation of behavior in the 
client, nor the formulation of some theoretic picture 
in the mind of the therapist (9, p. 207) 


® It is difficult to see how the terms “guidance and 
control,” in the sense that Rogers seems to use them, 
do in fairness characterize the mainstream of con- 
temporary psychoanalytic thinking. A semantic con- 
fusion seems to arise from the conception that the 
psychotherapist who functions in the role of a pro- 
fessional expert ipso facto manipulates the patient or 
controls him, and that guidance in the educational 
sense is synonymous with authoritarianism. Compare, 
in this connection, the formulations of a prominent 
psychoanalyst regarding therapeutic goals and the 
part played by the therapist: 

Treatment, of course, is aimed at the solution of 
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Rogers’ conception of the “ideal” therapist 
is that of a participant in an interpersonal 
situation whose primary function is to empa- 
thize with and understand the patient within 


the patient’s difficulties in living and the cure of his 
symptomatology. Ideally these therapeutic goals will 
be reached by the growth, maturation, and inner in- 
dependence of the patient... . This goal will also 
be actualized by the development of his capacity for 
self-realization, his ability to form durable relation- 
ships of intimacy with others, and to give and ac- 
cept mature love (3, p. 34). 

. Security and inner independence of the au- 
thoritarian values attributed to the conventional re- 
quirements of our culture are indispensable for the 
therapist who wants to guide his patients successfully 
toward finding out about the degree of cultural ad- 
justment which is adequate to their personal needs 
(3, p. 33). 

The psychiatrist’s respect for his patients will also 
help him to safeguard against the previously men- 
tioned mistake of assuming an attitude of personal 
“irrational authority” instead of listening and con- 
ducting therapy in the spirit of collaborative guid- 
ance. This irrational authoritarian behavior will be 
harmful not only because it interferes per se with 
the patient’s tendency toward growth and matura- 
tion but also, and more important, because it consti- 
tutes a traumatic repetition of the authoritarian as- 
pects of the cultural pattern of behavior in general 
and of the parental pattern in particular, to which 
most mental patients have been harmfully subjected 
in their past (3, p. 17). 

In his third point of difference, Rogers asserts that 
the experiential element in the psychotherapeutic 
situation is more important than the formulation of 
some theoretic picture in the mind of the therapist. 
This statement carries with it the implication that 
the therapist cannot adequately empathize with the 
patient, thereby helping to actualize the immediate 
moment of experience, if he attempts to understand 
rationally what is going on. Rogers has elaborated on 
this alleged dilemma in another paper (8). I, for 
one, know of no evidence that the therapist’s private 
thoughts about the nature of the patient’s emotional 
problem, its severity, etiology, etc., in and of them- 
selves preclude or interfere with empathy, nor that 
the exclusion of the therapist’s rational processes as- 
sures it. I feel that any psychotherapist does more 
than empathize with his patient: whether he inter- 
prets a feeling or attitude in terms of current or past 
interpersonal relationships, or whether he selects a 
particular feeling or attitude for reflection, he seems 
to follow a design for action dictated by his theory. 

Psychotherapy based on psychoanalytic principles 
does differ from other forms of psychotherapy in its 
emphasis on giving the patient a rational under- 
standing of his interpersonal processes as they emerge 
in the therapeutic relationship. The emotional experi- 
ence remains uppermost, but, in addition, the patient 
gains a conscious appreciation of his own contribu- 
tion to his difficulties in interpersonal relations. 
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his (the patient’s) frame of reference. From 
this vantage point, the therapist is imposing 
an external frame of reference if he attempts 
to diagnose, assess, or prognosticate. The same 
dictum would apply to hypotheses or hunches 
the therapist may have about the etiology of 
the patient’s conflict or how best to help him 
through psychotherapy. By this definition, the 
therapist who explicitly or implicitly goes be- 
yond the patient’s phenomenological self-de- 
scriptions is not client-centered. 

Certain objections voiced by Rogerian thera- 
pists regarding the experimental procedure are 
understandable from this point of view. While 
they cooperated willingly, they made it clear 
that ordinarily they do not work this way. 

On this basis, too, the following results are 
consistent with the client-centered viewpoint: 
a reluctance to make a diagnosis beyond the 
general label “anxiety,” to formulate thera- 
peutic goals, to mention dangers that might 
be encountered in therapy or cautions that 
would have to be observed, to encourage or 
discourage specific attitudes or behaviors in 
the patient, to treat him with strictness, to 
discourage the free expression of feelings 
(free association), or generally to consider 
problems of technique or “handling.” Rog- 
erians were also disinclined to make inferences 
about the patient’s behavior in the interview 
situation, the etiology of his disturbance, or 
the dynamic import of his behavior. Finally, 
they de-emphasized case history questions. 

One of the sharpest discrepancies between 
client-centered and analytically oriented thera- 
pists related to the question of prognosis: no 
Rogerian gave an unfavorable estimate, where- 
as 40% of Sample A took a pessimistic view. 
How can this finding be explained? 

First, it must be considered that our knowl- 
edge of neurotic disorders and psychotherapy 
is not sufficiently developed to permit specific 
predictions about the outcome of treatment 
even when a great deal is known about the 
patient. Such predictions undoubtedly become 
increasingly accurate as therapy progresses, 
and the therapist comes to learn more about 
the nature of the conflict, its ramifications, 
its history, and the patient’s current level of 
functioning. Thus, a prognostic estimate based 
on a single interview may be little more than 
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a guess, which may reveal more about the 
therapist than about the patient. 

Second, the key issue in this comparison 
seems to be the high degree of uniformity 
among Rogerians which was not paralleled by 
the analytically oriented therapists whose re- 
sponses appeared to be more “normally” dis- 
tributed. If therapists were talking about a 
specific event, the predictions of some obvi- 
ously must be right and the predictions of 
others must be wrong. Unquestionably, thera- 
pists were not talking about a specific event 
but about a therapeutic outcome concerning 
which a wide range of ideas and opinions were 
entertained. 

In an attempt to shed some light on this 
result, I propose to examine the following al- 
ternatives: (a) Rogerians gave a more favor- 
able prognosis because they perceived the pa- 
tient as less disturbed; (5) Rogerians planned 
for more modest therapeutic goals and there- 
fore gave a more optimistic prognostic esti- 
mate; (c) prognostic estimates may be rela- 
tively unrelated to the presenting problem or 
the therapist’s past successes with similar pa- 
tients, but reflect more directly the thera- 
pist’s attitude, his theoretical orientation, or 
combinations of these. 


(a) The available evidence indicates that thera- 
pists in both samples tended to regard the patient 
as seriously disturbed. They were also reasonably 
agreed that he suffered a great deal of anxiety, that 
he was emotionally immature, socially maladjusted, 
and that he had relatively little insight. There is no 
reason to believe, therefore, that Rogerians judged 
the patient’s difficulties as less severe 

(b) It is difficult to judge similarities or differ- 
ences in therapeutic goals because of the vagueness 
of the terms of reference. Rogerians tended to ad- 
vocate greater self-acceptance; analytically oriented 
therapists tended to strive for some insight. How- 
ever, the latter group generally did not recommend 
the more intensive forms of psychotherapy usually 
associated with psychoanalysis. On this basis, it 
seems fair to say that the Rogerians’ more favorable 
prognostic estimates were not a function of more 
limited therapeutic goals. 

(c) It has been reported previously (12) that 
there was a highly significant statistical relationship 
between psychiatrists’ prognostic estimates and their 
self-ratings of their conscious attitude toward the pa- 
tient (r = 39). For psychoanalytically oriented psy- 
chologists, the correlation was r= 42. Among Rog- 
erians, the invariance between attitude and prog- 
nosis was almost complete. Eight therapists at the 
University of Chicago Counseling Center expressed 
a positive attitude and rated the prognosis favorable. 
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In the case of psychiatrists, it was found 
that both attitude and prognosis were signifi- 
cantly associated with other clinical evalua- 
tions and treatment plans. In addition, thera- 
pists’ attitudes tended to be reflected in the 
emotional tone and the degree of empathy 
of their communications. These findings sug- 
gested the interpretation that clinical assess- 
ments tended to reflect, in part at least, the 
therapist’s emotional reaction to the patent.’ 
On the other hand, if the therapist had un- 
dergone personal analysis, his empathy tended 
to be greater irrespective of his conscious atti- 
tude. In other words, personal analysis seemed 
to introduce some distance between a nega- 
tive reaction to the patient and the therapist’s 
ability to empathize with him. It is possible, 
then, that the therapist’s conscious attitude 
toward the patient is less important than what 
he does about it.* 


7 It seems reasonable to assume that clinical evalua- 
tions, treatment plans, and goals are a function of 
the diagnosis. Comparisons of therapists (based on 
the total! sample) giving a “pure” diagnosis of Hys- 
teria, Obsessive-compulsive, Paranoid, and Anxiety 
showed a number of statistically significant differ- 
ences in terms of clinical evaluations, etc. (12). It 
was pointed out in the same connection that the 
choice of certain diagnostic labels (paranoid, psycho- 
path) may signify a subtle value judgment which in 
turn may be determined by the therapist’s attitude 
toward the patient. The data seemed to favor this 
conclusion. 

8 It should be recalled that the operational defini- 
tion of therapist’s conscious attitude toward the pa- 
tient, as used in this investigation, was a self-rating 
on a five-point scale. Apart from considerations of 
reliability, this rating was undoubtedly influenced by 
therapists’ willingness to express their genuine con- 
scious attitude, unconscious distortions, etc. It is 
difficult to say to what extent such ratings repre- 
sent countertransference reactions even if there were 
greater agreement about the term (5). It seems that 
Gitelson’s (4) description of reactions to the patient 
as a whole, which he terms transferences of the ana- 
lyst as distinguished from reactions to partial aspects 
of the patient (countertransference), fits the present 
ratings reasonably well: 

It is my impression that total reactions to a pa- 
tient are transferences of the analyst to his patients 
and are revivals of ancient transference potentials. 
These may be manifested in the over-all attitude to- 
ward patients as a class or may exacerbate in the 
“whole response” to particular patients. These atti- 
tudes may be positively or negatively toned. They 
are likely to manifest themselves ea:ly in the contact 
with a patient and determine the tendency of the 
analyst towards the whole case (4, p. 4). 
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To integrate a workable therapeutic rela- 
tionship, the therapist, according to psycho- 
analytic theory, must convey an attitude of 
acceptance, patience, respect, etc. He must 
also be able to empathize (1, p. 580). This 
does not mean that he must have or should 
have a positive attitude toward all aspects of 
the patient’s behavior. Rather, it is recog- 
nized that the therapist, like other people, 
may experience emotional reactions. However, 
he is enjoined to be vigilant to make sure that 
they do not interfere with the therapeutic 
process and if they do, to seek clarification 
through self-analysis or other means. 

The emphasis on diagnosis and etiology, ac- 
cording to analytic theory, determines at least 
partially the therapist’s approach to therapy. 
Reich says: 


It should be clear that one approaches an aggres- 
sive patient unlike a masochistic one, a hyperactive 
hysteric unlike a depressive one, that one changes 
one’s attitude to one and the same patient according 
to the situation, that, in brief, one does not behave 
neurotically oneself, even though one may have to 
deal with some neurotic difficulties in oneself. 

One cannot give up one’s own individuality, a fact 
which one will consider in the choice of patients. 
But one should be able to expect that this indi- 
viduality is not a disturbing factor and that the 
training analysis should establish the necessary mini- 
mum in plasticity of character (6, p. 139). 


From these considerations it would follow 
that, ideally, the prognosis for any one pa- 
tient represents a realistic assessment of the 
severity of the disturbance reflecting clinical 
experience regarding the response of similar 
patients to psychotherapy. 

While evidence has been presented (12) 
that a prognostic estimate may be based on 
the therapist’s emotional reaction to the pa- 
tient rather than on the clinical evidence, it 
does not follow that an unfavorable prognosis 
is ipso facto proof of a rejecting attitude on 
the therapist’s part. It may well be a realistic 
assessment of the clinical picture. 

It appears that the results obtained from 
client-centered therapists must be explained 
on different grounds. First, one of the most 
important conditions for constructive person- 
ality change postulated by Rogers (10) is the 
therapist’s “unconditional positive regard” for 
the patient. This means a “prizing” of the 
person, the absence of conditions of accept- 
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ance and of a selective evaluating attitude. 
Unconditional positive regard appears closely 
related to empathy, another of Rogers’ key 
postulates, which is defined as the therapist’s 
“experiencing an accurate, empathic under- 
standing of the client’s awareness of his own 
experience” (10, p. 99). 

Second, in explicitly rejecting the usefulness 
of diagnosis for psychotherapy, Rogers denies 
that the therapist’s approach or operations 
should be influenced or modified by the type 
of patient or disturbance (10, p. 101). This 
would also imply that prognostic estimates, 
to the extent that they attempt to take into 
account the severity or chronicity of the pa- 
tient’s disturbance, are pointless. 

Thus, it appears logical that the client-cen- 
tered therapist who fully accepts the theory 
must (a) describe his attitude toward the pa- 
tient as positive and (b) give a favorable 
prognosis. The ratings under discussion seem 
to reflect very clearly the respondents’ theo- 
retical orientation and are perhaps best ex- 
plained on this basis. 

Whether a conscious attitude toward the 
patient is a more or less individual expres- 
sion of the vierapist’s emotional reaction or 
whether the attitude is an intrinsic part of 
the therapist’s theoretical framework, or both, 
the more important question relates to the 
manner in which this attitude is expressed in 
the therapeutic situation. Some evidence (12) 
has been adduced that there is in fact a carry- 
over. It remains to be demonstrated to what 
extent the therapist’s initial attitude toward 
the patient influences the course and outcome 
of therapy, but there seems to be a very real 
possibility that the therapist making a nega- 
tive evaluation and/or reacting negatively to 
the patient may unwittingly contribute to the 
realization of the events he diagnoses or pre- 
dicts. Contrariwise, the therapist who brings 
a genuinely positive approach to bear upon 
the relationship may thereby promote a more 
favorable outcome.’ Both psychoanalytic and 
client-centered theory emphasize that the per- 
son of the therapist, as revealed in this rela- 


If it turns out that the therapist’s personality— 
his dedication, faith, belief—is indeed a primary 
force in modern psychotherapy, painstaking research 
may still give a clearer picture of the components. 
(See in this connection [2].) 
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tionship, will in time serve as a new, more 
mature, and more desirable model of reality 
than past interpersonal relationships which 
have distorted the patient’s perceptions of 
himself and others (13): 


What is curative in the [therapeutic] process is 
that in tending to reconstruct with the analyst that 
atmosphere which obtained in childhood, the patient 
actually achieves something new. He discovers that 
part of himself which had to be repressed at the 
time of the original experience. He can only do this 
in an interpersonal relationship with the analyst, 
which is suitable to such a rediscovery. . . . Thus, 
the transference phenomenon is used so that the pa- 
tient will completely reexperience the original frames 
of reference, and himself within those frames, in a 
truly different relationship with the analyst, to the 
end that he can discover the invalidity of his con- 
clusions about himself and others (7, p. 151). 


Rogers puts it this way: 

In a minimal way, the client may perceive from 
the first the unconditional positive regard of the 
therapist for him. The perception is continually 
strengthened as he discovers that each facet of him- 
self which is exposed—contradictions, weaknesses, 
strengths, abnormal feelings, tender feelings, vicious 
attitudes, antisocial behaviors, fears and despairs— 
are all met with equal positive regard, because each 
of these elements is a part of him, and he is prized 
unconditionally. The client then gradually takes in 
the experience of being loved—a love which is non- 
possessive and nondemanding, warm but not over- 
solicitous. As he lives for a sequence of hours in this 
atmosphere, the experience has two primary results. 
It permits him to relax the tight defensive structure 
of his concept of himself and to admit into aware- 
ness and fully experience attitudes which previously 
he had found too threatening . In the second 
place, the attitude of the therapist toward him is 
gradually internalized so that he can take the same 
attitude toward himself. He comes to prize himself, 
to feel that he is of value (9, p. 204) 

Much remains to be learned about the 
therapist’s personality and its contribution to 
the treatment process, but, as I have sug- 
gested elsewhere (11), this focus on the per- 
son of the therapist may be one of the most 
fruitful avenues for research in this area. 


Summary 

This investigation was designed to obtain 
empirical evidence on psychotherapists’ per- 
formance in an initial interview for the pur- 
pose of studying the influence of relevant 
therapist variables. 

Psychotherapists were asked to respond as 
vicarious interviewers to a sound film of an 
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unrehearsed initial interview. Data were col- 
lected from 235 psychotherapists in several 
cities. The present paper, part of a larger 
project, was concerned with a comparison be- 
tween 64 psychoanalytically oriented thera- 
pists and 14 therapists following Rogers’ 
client-centered theory. All respondents were 
psychologists, and 8 Rogerians were staff 
members of the University of Chicago Coun- 
seling Center. 

Statistical comparisons of questionnaire re- 
sponses to a series of items dealing with diag- 
nosis, prognosis, treatment plans, and _ atti- 
tudes disclosed a number of significant differ- 
ences between the two samples. Rogerians 
were generally disinclined to make plans for 
treatment or to set up therapeutic goals. 
Their prognostic estimates were more favor- 
able, and they rated their attitude toward the 
patient as more positive than analytically ori- 
ented therapists. 

In discussing the implications of these find- 
ings, attention has been drawn to the inter- 
relationships between therapist’s attitude to- 
ward the patient and clinical evaluations, in- 
cluding diagnosis and prognosis. In view of 


the self-realizing character of one’s expecta- 
tions, the therapist’s attitude toward the pa- 
tient may have important effects upon the 
course and outcome of psychotherapy. 

This investigation has stressed the great 
need for research on the therapist’s contribu- 
tion to the treatment process. 


Received August 9, 1957. 
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As Freud’s systematic thinking progressed, 
aggression emerged from a position of relative 
obscurity in 1905 into one of decided promi- 
nence in 1930. At first, aggressive impulses 
were regarded as derivatives or components 
of sexuality (10). Later Freud contended that 
each instinct had its own power of becoming 
aggressive, deprivations being especially pro- 
ductive of aggressive responses (9). 

Freud’s formulation of the death instinct 
(6, 7) next led him to assert that a portion 
of that instinct, directed toward the outer 
world, was manifested as an aggressive in- 
stinct. However, it was not until 1930 (8) 
that a substantial body grew onto the theo- 
retical framework of aggression. A desire for 
aggression was then said to be an independ- 
ent, innate drive. This instinctual disposition 
varied in intensity from man to man. Libido 
and aggression were viewed as primary in- 
stincts entwined with strivings toward life 
and death (6). 

The relationship between aggression and 
deprivation was experimentally examined by 
learning theorists and found to be relevant. 
Consequently, learning theory has given at- 
tention to aggression. At first it was assumed 
that frustration was a direct antecedent of 
aggression (2). As the investigations pro- 
ceeded, it was realized that frustration could 
give rise to a variety of responses, one of 
which could be some form of aggression (17). 
Aggression was viewed as a secondary, rather 
than as a primary, drive because it did not 
seem to involve any biological changes in the 
organism. Instead, aggression was held to be 


a simple, learnable response, subject to modi- 
fication. The selective agent in the learning 
of adult aggressive habits was reinforcement. 
The position of aggression in the hierarchy 
of responses to frustration depended upon 
whether past conditions were such as to re- 
inforce aggression as a response to external 
cues. Despite a continued emphasis on the 
importance of learning in aggression, learning 
theorists no longer deny the role of innate 
factors (3). 

There is an interesting parellel here. Two 
powerful psychological viewpoints seem to 
have gradually admitted aggression into more 
and more of their hypothetical realm. Such a 
similar change in two independent systems of 
thought may foreshadow the eventual impor- 
tance of aggression in other psychotherapeutic 
theories. The theoretical development of ag- 
gression indicates it would be worthwhile to 
consider its possible importance to a psycho- 
therapeutic theory that does not at present 
include it. 


Aggression and Client-Centered Theory 


One of the newest of psychological systems, 
the nondirective or client-centered approach, 
has not directly incorporated aggression. The 
fourth proposition of Roger’s formulation of 
personality is concerned with “the tendency 
of the organism to behave defensively in the 
face of threat, to achieve the goal of self- 
maintenance even when the usual pathway to 
that goal is blocked” (22, p. 488). However, 
he included these tendencies, which might 
sound like aggression, with food assimilation 
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and a general striving toward maturation, the 
tendency to which all organic and psychologi- 
cal needs might be ascribed by nondirectivists. 
If the importance achieved by aggression in 
the two concepts previously discussed is a 
proper forecast, then it appears likely that it 
is through this proposition that aggression 
will enter nondirective personality theory. 


Age and Aggression 


In both psychoanalytic and learning theory 
the appearance and expression of aggression 
are Closely dependent upon both maturational 
and experiential factors which follow a fairly 
close chronological sequence. To psychoana- 
lysts, genetic factors must be considered in 
the establishment of libidinal development 
and characteristic defenses. It would be clini- 
cally incorrect for them to assume exclusively 
that an aggressive child is reacting constantly 
to external frustration and to exclude the con- 
siderations of inner sources such as ego dis- 
turbances and libidinal factors. 

In analytic work with children, aggressive 
qualities reach their peak with the anal stage 
and become especially evident also during the 
Oedipal period (5). When the child reaches 
the stage in which his ego or superego disap- 
proves of aggression turned outward, aggres- 
sive impulses will be directed to substitute ob- 
jects, or modified by restriction, sublimation, 
and internalization (12). 

The learning theorists explain these changes 
on another basis. Both schools would prob- 
ably agree that the child is better able to 
handle instinctual drives by the acquisition 
of language. Psychoanalysts, however, see lan- 
guage integration as enabling ego mastery to 
proceed, while the learning theorists see it as 
an example of further learning which enables 
earlier, more diffuse, responses to be elimi- 
nated and more selective responses, such as 
verbal aggression, to be utilized. This verbal 
aggression, however, is subject to modification 
by reinforcement. As the child grows older 
and reactions with adults become important 
to him, his pattern of aggression is modified. 
Progression from infantile diffusion to di- 
rected attacks, to verbal specialization, to a 
marked reduction seems to be the behavioral 


Dell Lebo 


pattern of aggression expression in normal 
children. 

As a child grows chronologically, certain 
changes occur which influence the pattern of 
aggression. Whether these changes are called 
a developing superego or a modification 
through learning experiences, the position of 
aggression in the hierarchy of available re- 
sponses does seem to change in an observable 
manner. 

Other important behavioral changes take 
place from one age level to another. Gesell 
and Ilg have indicated that as a child grows 
older the “interior patterns of his private psy- 
chology” are changing as well (11, p. 12). 
Such changes have been a concern of psycho- 
analytically oriented play therapists who ad- 
vocate different treatment approachcs to chil- 
dren of different ages (16). 


Age in Nondirective Play Therapy 


Just as the concept of aggression has not 
been admitted into client-centered philosophy, 
so has the concept of personality changes with 
advancing age been lacking from formulations 
of the nondirective play therapy process. Per- 
haps the conceptualization of nondirective 
play therapy (1) excluded consideration of 
age differences because of the emphasis on 
process in nondirective therapy in play ther- 
apy: “Experiences with play therapy .. . 
add to the conviction that therapy is very 
definitely a process, a process which follows 
certain fundamental trends” (21, p. 170). 

Nondirective play therapy, based directly 
upon the techniques of nondirective counsel- 
ing, emphasizes the forces within each indi- 
vidual to find mature behavior more satisfy- 
ing than immature behavior. “This type of 
therapy starts where the individual is and 
bases the process on the present configuration 

’ (1, p. 15). Since the individual is ac- 
cepted where he is in client-centered therapy 
and the process begun, the child is accepted 
similarly where he is chronologically and the 
process of nondirective play therapy begun. 

There may be a recognition that person- 
ality changes from age to age in the ex- 
pression of the one basic nondirective need, 
namely, the striving for maturity. However, 
such an implied recognition of age differences 
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does not enter into the method of nondirec- 
tive play therapy. The process is the same for 
all children admitted to the playroom regard- 
less of age. 


Studies of the Process of Nondirective 
Play Therapy 


As has been shown, both age and aggres- 
sion have been found to be factors of keen 
importance in other theories of psychother- 
apy, especially where children were concerned. 
Yet neither factor has been considered gener- 
ally in studies of the process of nondirective 
play therapy. Such studies fall into two types, 
methodologically as well as chronologically 
distinguishable. 

Studies of the earlier type involved exami- 
nation of the process of nondirective play 
therapy as found in already available proto- 
cols. Later studies’ were objective investiga- 
tions of the protocols of selected children. 


Protocol-Centered Studies 


Landisberg and Snyder (13) examined the 
protocols of four cases to determine what 
took place in nondirective play therapy. They 
found similarities in the process from case to 
case. Although the four children ranged in 
age from five to six years, the categories used 
to classify the verbal remarks had been de- 
veloped for use with adults. Since the cate- 
gories did not originate from children’s ver- 
balizations, their representativeness may have 
been forced and unrealistic. Furthermore, the 
narrow age range prevented inferences re- 
garding the process of nondirective play ther- 
apy at different age levels. 

The Procrustean forcing of children’s state- 
ments into adult categories was obviated by 
Finke (4) who developed new categories 
based on an analysis of children’s statements 
made in nondirective play therapy. She used 
protocols from six children ranging in age 
from five to eleven years. It was found that 
the children, though in therapy with different 
therapists, showed similar trends in their ver- 
bal expressiveness. Her study did not consider 
the possibility that just as the wide age dis- 
crepancies between adults and children might 
influence their verbalizations, so might chil- 
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dren’s categories vary significantly from one 
age level of maturity to another. 

Seemingly the last study of this type was 
submitted for publication in 1952 (18). It 
was concerned with similarities in the pattern 
of emotional growth of normal and disturbed 
children. Extensive excerpts, arbitrarily se- 
lected and subjectively interpreted, revealed 
that “the emotional process and the changes 
in feeling tones evident as a child goes through 
play therapy ... occur... not step by 
step, but in individually varying sequences” 
(18, pp. 97-98). 

This study was nonobjective and resembled 
the suggestive investigations appearing in the 
persuasive or propagandistic period of re- 
search in nondirective play therapy (15). No- 
where was consideration given to age. 


Child-Centered Studies 


The first publication of the later type of 
investigation was apparently that of the pres- 
ent writer (14). Twenty children, aged 4, 6, 
8, 10, and 12 years, equated for intelligence 
and social adjustment, were used as subjects. 
Maturation, as represented by chronological 
age, was found.to account for different types 
of statements made in the nondirective play 
situation. 

Similar studies by Moustakas (19, 20) have 
appeared since. One of them (19) studied the 
frequency and intensity of the expression of 
negative attitudes of nine well-adjusted and 
nine disturbed young children matched in 
various characteristics. At the end of four 
play therapy sessions, both groups expressed 
about the same types of negative attitudes. 

Another example of subject-centered re- 
search concerned 10 four-year-old children 
with and without serious emotional problems 
(20). “Data obtained . . . showed more simi- 
lar than divergent behavior in the two groups 

.” (20, p. 155). The most significant dif- 
ference between the groups was that the mal- 
adjusted children were nonattentive. This dif- 
ference was explained by the maladjusted 
children’s noninteractive behavior, i.e., fan- 
tasy or activities that excluded the therapist. 
The degree of aggressiveness was not con- 
trolled. One wonders if the results would have 
been the same had aggression been considered. 
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The above mentioned studies represent all 
of the published work on the process of non- 
directive play therapy known to the present 
writer. While the studies are similar they are 
not directly comparable. In only four investi- 
gations does age seem to have been controlled. 
In three publications, age was regulated by 
restriction; only children within a narrow age 
range were studied: 5—6 years (13) and 4 
years (19, 20) respectively. Hence, only one 
study clearly took age differences into account 
(14). This study alone found definite changes 
in the types of statements made-by children 
in nondirective play therapy. 

Only two of the studies clearly controlled 
intelligence (14, 19). Hence, differences in 
intellectual ability, long recognized as being 
of fundamental importance in speech habits 
(vide the items of the Stanford-Binet Intelli- 
gence Test), have not been considered in 
many studies of the verbalizations made in 
nondirective play therapy. This lack would 
seem to be of sufficient magnitude to vitiate 
investigations that have failed to control for 
it. None of the studies considered the possi- 
bility of differences arising in verbalizations 
as related to aggressiveness or lack of ag- 
gressiveness in children. 

The empirical progress made in psycho- 
analysis and learning theory has been antici- 
pated by the construction of adequate theory. 
Further developments in nondirective play 
therapy would seem to require suitable the- 
ory. Without such a theoretical framework, 
nondirective play therapy may be in danger 
of becoming immeshed in the “regularity” of 
its process, until, like introspectionism, it be- 
comes process bound and unproductive. A 
good theory is said to be economical in per- 
mitting many observations to be subsumed 
under a single proposition. Theories concern- 
ing changes ascribable in aggression or age 
would seem to be both economical and mean- 
ingful. 


Summary 


Psychotherapeutic processes may vary con- 
siderably as a function of both the age of the 
subject and the dynamic processes character- 
izing his psychopathology. In childhood, the 
relationship between age and aggression is es- 
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pecially close. Nondirective play therapy does 
not formally consider the influence of aggres- 
sion or age in its method. As a result, studies 
of the process of nondirective play therapy 
generally indicate that all children undergo 
the same process. It would seem, however, 
that if age and aggression were evaluated 
properly, the course of nondirective play ther- 
apy would vary with the age of the child and 
with the character of the psychodynamic 
processes, €.g., aggression, associated with or 
defining his problem. Insistence on the same- 
ness of the process, and consequently of the 
method, may only result in the tautological 
death of productive work in nondirective play 
therapy. 

These assertions were supported by infer- 
ences drawn from the history of Freudian and 
learning theory and by data from studies of 
the process of nondirective play therapy. 


Received August 12, 1957. 
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Personality Patterns of Patients Before and After 
Application for Psychotherapy ' 


Norman S. Greenfield 


University of Wisconsin Medical School 


A previous study (1) utilized the MMPIs 
of 132 male college students who later at some 
time in their four-year college course sought 
psychotherapy at a student health psychiatric 
clinic, with the aim of testing the hypothesis 
that promptness of referral is positively re- 
lated to the degree of anxiety, the extent of 
internalization of problems, the amount of 
discomfort, and the severity of pathology. 
The results were surprising in that none of 
the variables proved to be significantly re- 
lated to the number of months which inter- 
vened between the time of taking the test and 
appearance at the clinic. Thus, in the search 
for precise motivational variables influencing 
utilization of psychotherapeutic services, it 
was impossible to verify even such simple 
“truths” which have so long received con- 
sensual validation. It was concluded that for 
the population studied, at least, these meas- 
ures were not indices of therapeutic readiness. 

Having demonstrated an inability to pre- 
dict which students would eventually become 
patients and which would not in a setting 
wherein psychotherapy was equally available 
to all students at no cost to the patient, the 
question was then raised as to what person- 
ality factors, or more precisely what changes 
in personality, influenced the decision to ap- 
ply for treatment. Again, logic and clinical 
experience led us to assume that, other fac- 
tors being equal, a person seeks psychother- 
apy at a time when he is feeling worse, or 


1An extended report of this study may be ob- 
tained without charge from Norman S. Greenfield, 
Department of Psychiatry, University Hospitals, 
Madison 6, Wisconsin, or for a fee from the Ameri- 
can Documentation Institute. Order Document No. 
5584, remitting $1.25 for microfilm or $1.25 for 
photocopies. 


suffering more than when he did not seek psy- 
chotherapy. 

The routine administration of the MMPI” 
to entering students made possible an exami- 
nation of self-controlled profiles of 31 stu- 
dents who were given a second MMPI at the 
clinic. The Ss were 20 males and 11 females 
whose mean age was 19 years. The mean 
number of months intervening between first 
and second test administrations was 10.97, 
with a range from 1 to 26 months. 

Clinically, the presenting complaints of the 
patients as well as their psychiatric diagnoses 
reflected the usual range of patient material 
encountered at the clinic. 

None of the differences between the scores 
of the two administrations approached signifi- 
cance. Both in pattern and amplitude pre- 
and posttests were almost identical. 

The voluminous body of literature utilizing 
the MMPI to demonstrate differences be- 
tween groups and, even more sensitively, 
changes in scores following psychotherapy 
suggests caution in cavalierly discarding these 
data and turning to the parsimonious chal- 
lenge of the validity of the technique. In any 
case, these results furnish some food for 
thought. 


Brief Report. 
Received January 30, 1958. 
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The Diagnostic Validity of the Ames “Danger Signals” 
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Illinois Institute for Juvenile Research 


The application of the Rorschach to very 
young subjects has resulted in an accumula- 
tion of studies indicating that children nor- 
mally give responses that are pathological in 
adults (10, 13). The publication of Child 
Rorschach Responses by Ames and her as- 
sociates (1) represented a milestone in that 
developmental norms were presented system- 
atically at yearly intervals for a carefully 
studied group of children ten years of age and 
under. Toward the end of this volume, Ames 
presented a list of “danger signals” which, 
even within the developmental context of a 
child’s Rorschach, might still be considered 
indicative of pathology. The present study 
was designed to test whether these “danger 
signals” had sufficient diagnostic validity to 
differentiate between a clinic and a normal 
population. As part of a study of normal 
eight-, nine-, and ten-year-olds undertaken by 
the Institute for Juvenile Research, 54 Chi- 
cago school children described as well-ad- 
justed by their teachers were given a com- 
plete diagnostic evaluation. For the purposes 
of this study, an effort was made to match 
each of these nonclinic cases with a child who 
had been referred to the Institute as being in 
need of child guidance services. The present 
report deals with a comparison of the Ror- 
schach records of 40 nondisturbed children 
with the records of their matched controls. 


Subjects 


The Ss were 40 normal children (27 boys 
and 13 girls) and their controls who were 
matched for sex, age, and IQ. Matching was 


1 The author would like to express her gratitude to 
Charlotte Altman who tested the entire nondisturbed 
sample; to Eugene Levitt who gave helpful sugges- 
tions; and to Irene Goldblatt, Jeanne Hershenson, 
and Victor Lebow who served as scorers and raters. 


in most cases within three months of age and 
five points of IQ. The matching for IQ was 
only approximate. All of the nondisturbed Ss 
were given Form L of the Binet by the same 
examiner, while their matched controls were 
given a variety of other tests, both group and 
individual. In all likelihood, the clinic group 
is slightly more intelligent than the nondis- 
turbed sample, since the clinic children were 
often matched on the basis of tests scores that 
are known to run lower than Binet ratings. 
This direction of a possible difference was 
considered safest so that there would be no 
question of the clinic children’s looking “‘sick”’ 
only because they were merely less intelli- 
gent. Actually, both the clinic and the non- 
disturbed children taken together constitute a 
highly homogeneous sample of children of su- 
perior intelligence. In this respect, both groups 
are comparable to the normative sample used 
by Ames (1, p. 25). 

Although no effort was made to match for 
socioeconomic status, the matching for IQ re- 
sulted in a clinic and a nondisturbed sample 
which were quite comparable in this regard. 
The socioeconomic level of the Ss was evalu- 
ated by means of the scale of paternal oc- 
cupations described by Goodenough (6, pp. 
501-512). Three clinical psychologists classi- 
fied the 80 Ss. In 39 of the 80 cases, all three 
judges agreed on the classification. In an ad- 
ditional 31, one judge differed by one cate- 
gory from the other two. The S was then as- 
signed to the category agreed upon by two 
out of three judges. In the nine cases with 
ratings spread over three categories, the S 
was assigned to the middle category. Both 
the clinic and the nondisturbed group are 
slightly below the Ames sample in regard to 
socioeconomic status: 93% of the Ames sam- 
ple (1, p. and 59% 


25) of the combined 
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clinic and normal groups of this study fall 
into Categories I through III on the scale of 
paternal occupations. The atypically high 
socioeconomic status of the Ames sample in 
comparison to the general population raises 
the possibility of bias in the Ames normative 
data. In fact, Fiedler and Stone (5, p. 279) 
found striking differences between their sam- 
ple of undisturbed children of low socioeco- 
nomic status and the Ames sample of high 
socioeconomic status. However, in this study, 
any possible effects of bias in the Ames norms 
are controlled by the matched-pairs design. 


Procedure 


All 40 Rorschachs of the nondisturbed chil- 
dren were administered and scored by a psy- 
chologist with 15 years’ experience and certi- 
fication by the American Board of Examin- 
ers in Psychology. The 40 Rorschachs of 
the matched controls were administered and 
scored by 15 psychologists of varying levels 


Table 1 


Descriptive Information About Subjects 
Used in This Study 








Nondisturbed Clinic 


Group 
N = 40 


Group 


Variable N = 40 





8-4—10-0 
108.38 mos. 
5.50 mos. 


8-3—9-11 
108.15 mos. 
5.40 mos. 


1Q 
Range 100-155 
Mean 120.30 
SD 14.06 


Socioeconomic status 


I. Professional 

II. Semiprofessional and 
managerial 

III. Clerical, skilled 
trades, retail business 

IV. Semiskilled, minor 
clerical 

V. Slightly skilled 

VI. Day laborers 
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of experience. Because of the extreme diffi- 
culty of obtaining enough clinic cases to 
match with a group of normals of such high 
intelligence, it was not possible to control for 
experience or identity of the examiner. Since 
arrangements could not be made to have all 
80 Rorschachs retyped without identifying 
information, this study was limited to objec- 
tive signs whose presence or absence could be 
evaluated merely by a check of the scoring 
sheet. 

Of the 13 “danger signals” listed by Ames 
and her associates, 4 were dropped because 
they involved subjective judgments. The re- 
maining 9 were raised to a total of 13 because 
of the necessity of treating the parts of four 
complex signs as separate signs (e.g., Sign 10: 
A% over 60 and no human responses). The 
13 signs used in this study are listed below 
with the numbering used by Ames (1, p. 283). 
Qualifications introduced by the author are 
given in parentheses. 

1. Preponderance of tree or leaf responses (Pre- 

ponderance defined as over 50% of R) 
F% exceeds 75 
More than one pure color response 
5A. More than one response involving differenti- 
ated shading 
. More than one response involving diffuse, 
dysphoric shading, i.e., Clob 
More than one inanimate movement response 
W% less than 40 
. Dd% over 15. Dd% equals Dd, S, and Do 
responses over total R 
More than one X response 
More than one S response 
A% over 60 

10B. No human response (ie. no H: 

counted) 

11. R less than 10 


Hd not 


The evaluation of the 13 signs was done by 
the author. A slight element of subjective 
judgment could not be avoided in that some 
rescoring had to be done before even the ob- 
jective Ames signs could be checked. Of the 
80 protocols used in the study, 70 were scored 
by the Beck system (2), while the Ames sys- 
tem is closer to the Klopfer method (9). For- 
tunately, it has been common practice at the 
Institute to make marginal notations of m 
and FM. However, the elimination of sepa- 
rate D scorings for details covered in a W 
and the division of shading scores into Clob 
(diffuse, dysphoric) and F(C) (differenti- 
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ated) also had to be done. The author pre- 
pared a detailed instruction sheet indicating 
the differences between the Ames and the 
usual clinic scoring in regard to those vari- 
ables that would be involved in an evaluation 
of the Ames danger signals. The scoring rec- 
ord for each of the 80 protocols was then 
transferred to a new scoring sheet, with each 
response being scored only for the relevant 
variables which are as follows: 


Area (W, D, Dd, Do response, S) 

Major determinant (M, m, FM, C, either FC or 
CF, differentiated shading, diffuse shading with 
dysphoric element, F) 


Content (H, A or Ad, tree or leaf) 


No differentiations other than those listed 
were made. For example, no effort was made 
to differentiate between FC and CF. The 
Ames signs were then checked by counting 
the various types of responses on these scor- 
ing sheets. Tabulation of the Ames signs was, 
of course, purely clerical once the scoring 
sheets had been made out. 

As for the possibility of the introduction of 
bias in the rescoring, the author can state 
only that there seems to be satisfactory evi- 
dence in the literature that mere scoring of 
Rorschach responses has sufficient interex- 
aminer reliability to warrant being regarded 
as an objective procedure (7, 8). However, 
as a reliability check, one-fourth of the proto- 
cols (10 normals and 10 clinic cases) were 
scored by a colleague as well as by thc au- 
thor. The percentage of agreement on the 
scoring of location varied from 71 to 93 on 
the 20 records, with a mean of 81 and an 
average deviation of 5.25 percentage points. 
The percentage of agreement on the scoring 
of major determinant varied from 65 to 100 
on the 20 records, with a mean of 84 and an 
average deviation of 7.05 percentage points. 
The percentage of agreement on the scoring 
of content varied from 77 to 100 on the 20 
records, with a mean of 91 and an average 
deviation of 6.10 percentage points. There 
was perfect agreement on content scoring for 
a third of the records. 


Results 
Total Scores 


The total scores on the Ames list of danger 
signals were compared for the clinic and the 


Table 2 
Total Scores on Ames “Danger Signals” and Davidson 
Adjustment Signs in Matched Pairs of Clinic 
and Nondisturbed Subjects 


Non 
disturbed Clinic 
Group 


Group 





Ames “Danger Signals” 


Mean 
SD 


Davidson Adjustment Signs 
Mean 
SD 
Prorated mean 


* Significant at .05 level 


nondisturbed groups by means of a variation 
of McNemar’s Formula 26 for the evaluation 
of the difference between two correlated means 
(12, p. 74). The results are presented in 
Table 2. The clinic group had significantly 
fewer danger or maladjustment signs than the 
nondisturbed group. Before investigating the 
direction of the discrimination done by each 
individual Ames sign, the author compared 
the clinic and the nondisturbed groups by 
means of 13 of the 17 Davidson adjustment 
signs (4). The two signs involving color shock 
and shading shock respectively were omitted 
because they involve subjective judgment. 
The two signs involving FM and F% respec- 
tively were omitted because the records were 
scored by the Beck system. The Davidson 
signs were evaluated on the basis of the scor- 
ing done by the original examiners without 
any rescoring by the author. The results of 
this comparison are also presented in Table 2. 
In contrast to the Ames signs, the Davidson 
signs show an absolute difference in the ex- 
pected direction; however, the difference is so 
small that there is a high probability that it 
could be a chance deviation from a true dif- 
ference of zero (p > 60). Although all sta- 
tistical tests were done on the basis of the 13 
signs actually counted for both the nondis- 
turbed and the clinic groups, the mean based 
on 13 signs was prorated to obtain the esti- 
mated mean had all 17 signs been used. The 
prorated means are also shown in Table 2. 
An estimate of the accuracy of the prorating 
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is provided by some additional data available 
for the nondisturbed sample only. The origi- 
nal examiner had evaluated the group on the 
basis of all 17 signs and obtained a mean of 
9.3. The prorated mean obtained by extrap- 
olating from the mean of 7.1 for 13 signs is 
also 9.3. The prorated means of 9.3 and 8.6 
for the nondisturbed and the clinic groups re- 
spectively are consistent with the results of 
previous studies which show slight differences 
between normal and disturbed groups with 
considerable overlapping of the distributions 
of scores. In the studies reviewed by David- 
son, means of 9.05 to 11.34 were reported for 
better-adjusted groups and means of 6.85 to 
8.17 for less well-adjusted groups (4). 


Individual Signs 

The Davidson signs (4) did not differenti- 
ate the nondisturbed from the clinic patients 
in the present study, and the Ames danger 
signals discriminated in the “wrong” direc- 
tion when evaluated in terms of total scores. 
Because of the possibility that individual 
Ames signs discriminated in different direc- 
tions, the author carried out a detailed analy- 
sis, sign by sign. The results of this analysis 
are presented in Table 3. 


Table 3 


Incidence of Ames “Danger Signals” in Matched Pairs 
of Clinic and Nondisturbed Subjects 





Nei- 


Only 
Both ther Non- 
of of Only dis- Chi 


Danger signal pair pair clinic turbed square 





1A Preponderance 
of tree and 
leaf R 
F% > 75 
Pure C > 1 
Differentiated 
shading > 1 
Diffuse, dysphoric 
shading > 1 
m>t 
W% < 40 
Dd% > 15 
X responses > 1 
>S>1 
A% > & 
NoH 
R< 10 


an 
adh nat al 
So FN NM 
Baan 


Or UN UF OF Ww 
coy 





* Significant at .05 level. 
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In the case of each sign, the number of 
instances of occurrence in the nondisturbed 
group and lack in the clinic cases were com- 
pared with the number of instances of occur- 
rence in the clinic group and lack in the non- 
disturbed cases by means of McNemar’s for- 
mula for chi square for dependent samples 
(12, pp. 204-206). The use of McNemar’s 
formula permits one to evaluate only those 
two cells of the fourfold table which repre- 
sent the difference critical for one’s hypothe- 
sis. In this case, only the third and fourth 
columns of Table 3 were compared. The Yates 
correction for continuity was used in all com- 
putations. 

The data in Table 3 suggest first of all that 
most of the Ames danger signals are not use- 
ful for differentiating between clinic and non- 
disturbed samples of eight- to ten-year-olds 
because nearly half of the signs are either 
characteristically present in a sizable propor- 
tion of both normal and clinic cases or else 
are almost never found in the records of 
either. Thus in 42% of the matched pairs, 
both the clinic and the nondisturbed mem- 
ber have a W% below 40. In 100% of the 
matched pairs, neither member has a pre- 
ponderance of tree and leaf responses. In 90% 
of the matched pairs, neither member has 
more than one response involving diffuse, dys- 
phoric shading. The only “danger sign’’ which 
has differentiating power that reaches statisti- 
cal significance is Sign 10A (A% over 60). 
This sign discriminates in an unexpected di- 
rection: Of 22 matched pairs whose members 
differ on this sign, three times as many non- 
disturbed as clinic cases have the high A%. 
The most promising signs which discriminate 
in the expected direction are Signs 3 (pure 
C > 1), 5B (more than one instance of dif- 
fuse, dysphoric shading) and 6 (m>1). 
These signs, like contamination for schizo- 
phrenia, are apparently diagnostic when they 
occur, but have such limited incidence as to 
be of little use for routine screening purposes. 


Determinants Involved in the 


Ames Danger Signals 


Ranges on 


The Ames signs, in contrast to the David- 
son signs, make less allowance for ranges of 
scores with an optimum somewhere in the 
middle. Therefore, at the risk of inflating 
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Table 4 


Comparison of Matched Pairs of Clinic and Nondis 
turbed Subjects in Regard to Scores Below the 
25th Percentile of the Ames Norms 


Nei- 
Determinantand Both ther 
level necessary of of Only dis- 
to fall below pair pair clinic turbed Chi 
25th percentile below below below below square 


Only 
non- 





F% below 52 4 20 8 x“ 0 
No pure C 31 0 4 0 
Differentiated 
shading zero 10 
Diffuse, dysphoric 
shading zero 
m of zero 
W% < 28 11 
Dd% zero 1 
A% < 40 3 
R< 13 0 


1.07 


10.23** 
5.06* 





* Significant at .05 level. 
** Significant at .01 level. 


probabilities, one further analysis of the data 
was performed. The matched pairs of clinic 
and nondisturbed cases were compared in re- 
gard to the determinants underlying the dan- 
ger signals in two ways: (a) frequency at or 
below the 25th percentile * of the Ames norms 
and (6) frequency above the 75th percentile 
of the Ames norms (1, pp. 54-56). These 
findings are presented in Tables 4 and 5. 
Table 4 indicates that in the case of those 
matched pairs whose members differ in re- 
gard to being below the 25th Ames percentile, 
there are significantly more pairs whose non- 
disturbed member is below the 25th percentile 
for number of m and for number of responses 
involving diffuse, dysphoric shading. (As be- 
fore, only Columns 3 and 4 were compared by 
means of McNemar’s formula for chi square 
for dependent samples.) Table 5 indicates 
that in the case of those matched pairs whose 
members differ in regard to being above the 
75th Ames percentile, there are significantly 
more pairs whose nondisturbed member is 
above the 75th percentile for A% and signifi- 
cantly more pairs whose clinic member is 


2In the case of those signs with zero as the score 
for the 25th percentile, comparisons had to be made 
in regard to comparative frequencies at the 25th per- 
centile since, of course, there could be no frequencies 
below zero. 
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above the 75th percentile for number of re- 
sponses involving diffuse, dysphoric shading. 

A comparison of Tables 4 and 5 with 
Table 3 lends no support to the author’s sus- 
picion that the Ames danger signals would 
be improved by the introduction of middle 
ranges of scores on various determinants. 
Three of the four significant differences in 
the latter two tables result from a change in 
the cutting score for variables which appear 
to have a linear relationship to good adjust- 
ment. Thus, a complete lack of m and of dys- 
phoric shading, rather than the presence of 
a moderate amount, differentiates the clinic 
from the nondisturbed sample. In the case of 
the fourth significant difference on A%, the 
data do not reveal whether there is any point 
at which the relationship between health and 
A% ceases to be linear in regard to the groups 
used in this study. The highest A% of 93 was 
found in the nondisturbed group, in addition 
to the fact that more of the undisturbed group 
exceeded the Ames 75th percentile (A% of 
58). 


Discussion of Results 


The Ames danger signals offer no ad- 
vantages over the other sign lists developed 
without special attention to the younger age 
groups and the developmental sequences par- 
ticular to them. Neither are the Ames danger 


Table 5 
Comparison of Matched Pairs of Clinic and Nondis- 
turbed Subjects in Regard to Scores Above 
the 75th Percentile of the Ames Norms 


Nei Only 

Determinantand Both ther non 

level necessary of of Only dis 
to exceed 75th pair pair clinic turbed Chi 


percentile above above above above square 





F% above 78 x 9 2.08 
Pure C > 1 38 0 50 
Differentiated 

shading over 1 : 10 56 
Diffuse, dysphoric 

shading > zero 
m> ti 
W% > 52 
Dd% > 16 
A% > 58 
R > 23 





** Significant at .01 level. 
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signals any worse for screening purposes. The 
fact that several of the Ames signs discrimi- 
nate in a direction opposite to that expected 
has been found to be true of comparable sign 
lists such as the Davidson adjustment signs. 
Brockway, Gleser, and Ulett (3), for exam- 
ple, found that 8 of the 17 Davidson signs 
discriminated in a reverse direction when used 
on a population of adult males. 

One of the studies most comparable to the 
one described in this paper was carried out 
at the same clinic by another investigator. 
Levitt (11) compared 39 clinic cases, includ- 
ing 4 used by the present author, with the 
normative sample of Thetford, Molish, and 
Beck in regard to alleged Rorschach anxiety 
indices. He found that only Sum Y discrimi- 
nated in the expected direction at better 
than the .05 level of confidence. The discrimi- 
nating power of Y was improved by separat- 
ing the differentiated shading responses (FY) 
from those with less of a form element (YF). 
Similarly, in the study by the present author, 
the best discrimination was obtained by the 
use of Clob responses only, the latter being 
roughly comparable to the YF responses of 
Levitt’s study. However, the sum of all shad- 
ing responses had some discriminating power. 
When the 40 matched pairs of clinic and non- 
disturbed subjects were compared in regard 
to total number of shading responses, signifi- 
cantly more pairs were found with the normal 
subject at or below the Ames 25th percentile * 
of zero shading responses (chi square 5.26 and 
p < .05). When a similar comparison was 
made in regard to scores above the Ames 75th 
percentile, there was a statistically insignifi- 
cant trend for more clinic than normal mem- 
bers of matched pairs to be above the Ames 
75th percentile for total number of shading 
responses. That is, complete lack of shad- 
ing responses is significantly associated with 
membership in the nondisturbed group, while 
having two or more shading responses is less 


8 The Ames percentile for total number of shading 
responses was estimated by adding the number of 
differentiated and diffuse shading responses at the 
given percentile. Thus the 25th percentile for all 
shading responses was estimated as zero, since zero 
is the 25th percentile for both F(C) and Clob. The 
75th percentile was estimated as 1 since the 75th 
percentile for F(C) is zero and for Clob is 1. 
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discriminating. Once shading responses are 
present, it is necessary to know the type be- 
fore meaningful discriminations can be made 
between groups. 

In general, those signs and determinants 
which discriminate in any way between the 
clinic and the disturbed group seem to have 
to do with tension (m, shading responses) 
and the stereotypy-conformity dimension of 
behavior (A%). Both the clinic and the non- 
disturbed samples are children of latency age. 
The nondisturbed sample appears to be dis- 
tinguished mainly by a consolidation of de- 
fenses. Apparently, deviation in the direction 
of more control than the normative Ames 
sample is not pathological at this age, whereas 
deviation in the direction of less control 
identifies the patient population. Although in 
the adult, magnitude of deviation may be of 
major importance, it is possible that in the 
child the direction of the deviation in terms 
of the child’s developmental level is equally 
important. The nondisturbed group in this 
study is exaggeratedly like the conforming, 
controlled model child that is the societal 
norm for the latency period. The fact that the 
nondisturbed sample was chosen by school 
teachers undoubtedly increased the prob- 
ability that a particular variety of the so- 
cietal ideal of normal latency adjustment 
would constitute the sample. Perhaps in the 
expansive period of adolescence the deviations 
in the direction of overcontrol are the more 
pathognomonic than similar quantitative de- 
viations in the direction of impulsivity. 


Summary 


Forty children described as well-adjusted 
by their teachers were matched for sex, age, 
and IQ with children referred for child guid- 
ance services to a large community clinic. 
Each group contained 13 girls and 27 boys. 
The average age of both groups was a little 
over nine years, and the average IQ of both 
groups was a little over 120. The matched 
pairs were compared by means of the Ror- 
schach in the following ways: (a) total num- 
ber of objectively scored Ames danger signals 
out of a list of 13, (5) total number of ob- 
jectively scored Davidson adjustment signs 
out of a list of 13, (c) presence or absence of 
individual Ames danger signals, (d) compara- 





Diagnostic Validity of Ames “Danger Signals” 


tive frequencies above the Ames 75th per- 
centile on determinants underlying the Ames 
danger signals, and (e) comparative frequen- 
cies below the Ames 25th percentile for deter- 
minants underlying the Ames danger signals. 

The nondisturbed group was found to have 
a significantly higher total of Ames danger 
signals. The difference between the two groups 
on the Davidson adjustment signs did not 
reach statistical significance, but was in the 
expected direction. When the individual Ames 
signs were evaluated separately, only A% 
above 60 was found to discriminate at a sta- 
tistically significant level. The difference was 
in the reverse of the expected direction, i.e., 
the nondisturbed group had more members 
with the high A%. When comparative fre- 
quencies above and below the Ames 75th and 
25th percentiles respectively were evaluated, 
only the following differences between the 
clinic and the nondisturbed group were found 
to be statistically significant (p < .05): fewer 
clinic children with no m, fewer clinic chil- 
dren with A% over 58, more clinic children 
with responses involving diffuse, dysphoric 
shading, and more clinic children having two 
or more of the latter type of response. 

The author concluded that the Ames dan- 
ger signals do not have sufficient diagnostic 
validity to make them useful for differentiat- 
ing between maladjusted and nondisturbed 
samples of eight- to ten-year-olds. Nearly half 
of the signs are either characteristically pres- 
ent in a sizable proportion of both normal and 
clinic cases or else are almost never found in 
the records of either. Several of the “danger” 
signs actually tend to be found more fre- 
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quently in normal records. However, the de- 
velopmental frame of reference underlying the 
signs was still seen as useful in interpreting 
the findings of the study. 


Received July 22, 1957. 
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An Investigation of Determinants Inducing Individuals 
to Seek Personal Counseling *” 


James S. Terwilliger and Fred E. Fiedler 


University of Illinois 


This study attempted to identify some of 
the determinants which lead individuals to 
seek therapeutic help. Very few situations per- 
mit an adequate study of this problem. Gen- 
erally the counselor only sees people who 
feel the need for psychotherapy, while many 
others, equally maladjusted by various cri- 
teria, choose, for various reasons, not to con- 
sult a therapist. It is, therefore, of consider- 
able theoretical and practical importance to 
determine how those who seek counseling 
differ in important respects from those who 
do not. 

Such a study was possible at the Univer- 
sity of Illinois where the Student Counseling 
Service deals with students’ educational, vo- 
cational, and personality problems. Students 
are urged to consult counselors if they, at 
any time, feel the néed for professional help. 
Counseling services are provided without 
charge. 

Sample. The subjects (Ss) were male Uni- 
versity freshmen who lived in temporary bar- 
racks which normally housed 13 to 14 men. 
Ss are assigned to the barracks in the order 
in which their application is received. 

On the basis of counseling service records, 
we identified a “counseling bureau group” 
(CBG) consisting of Ss who had visited the 
Counseling Bureau two or more times for per- 


1 This study was conducted under Contract DA- 
49-007-MD-569 between the Office of the Surgeon 
General, Department of the Army, and the Univer- 
sity of Illinois. 

2An extended report of this study may be ob- 
tained without charge from Fred E. Fiedler, 427 
Gregory Hall, University of Illinois, Urbana, IIli- 
nois, or for a fee from the American Documenta- 
tion Institute. Order Document No. 5582, remitting 
$1.25 for microfilm or $1.25 for photocopies. 


sonality problems (NV = 10). Those who had 
never consulted the bureau were placed into 
the control group (N = 87). 

Tests. A number of personality and adjust- 
ment measures were obtained from each S. 
These included an abbreviated Taylor Mani- 
fest Anxiety Scale, a self-esteem score, a self- 
ideal self discrepancy (or self-satisfaction) 
score. Also obtained were sociometric status 
indices to provide a measure of the individu- 
al’s acceptability to his peers, and “esteem 
by others” scores from S’s fellow students 
who lived in the same barracks. This latter 
score indicates S’s adjustment as evaluated 
by others. In addition, we also computed a 
Grade Point Average, adjusted for S’s per- 
formance in high school. 

Results. Taylor anxiety scores, self-satisfac- 
tion, and self-esteem scores of Ss who con- 
sulted therapists differed significantly in the 
maladjusted direction from those who did not 
consult a therapist. However, differences in 
other scores were not significant. 

Discussion. As is to be expected, anxious 
and less self-satisfied individuals are more 
likely to seek therapeutic help than are those 
who do not express such feelings. In contrast, 
however, these Ss do not differ from the con- 
trol group on objective or outside indices. Ss 
who consulted therapists were as well liked as 
others, they were esteemed as highly, and they 
tended to have slightly better grades. These 
findings suggest that the client is brought into 
therapy solely by subjective feelings of dis- 
comfort with himself rather than by “re- 
ality” factors. 


Brief Report. 
Received February 20, 1958. 
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Vocabulary as a Basis for Estimating Intelligence from 
the Rorschach’ 


Thomas R. Trier 


University of California, Berkeley 


An estimate of intelligence is usually in- 
cluded in the formulations made from the 
Rorschach. It can be particularly helpful 
when time limitations or other considerations 
render it impractical or impossible to adminis- 
ter an intelligence test. Beck (2) and Klopfer 
and Kelley (3) are among those who have 
tried to make more explicit the variables 
which should be considered in estimating in- 
tellectual level. Beck states that intelligence is 
related to Z, A%, F+%, Ap, and number 
and quality of M responses. Klopfer and 
Kelley suggest that an estimate of intelligence 
should be based on number and quality of 
W, M, and O responses, form accuracy level, 
variety of content, and succession of responses. 
Research seeking to determine whether the 
postulated relationships between Rorschach 
variables and IQ do in fact exist has been 
rather unsuccessful. Usually, when low, sig- 
nificant correlations between Rorschach fac- 
tors and IQ have been found, they have failed 
to hold up under cross-validation. The results 
reported by Armitage et al. (1) are typical 
in this respect, but are particularly worthy of 
mention because of the large number of cases 
employed. The authors computed the corre- 
lations between 19 Rorschach variables and 
Wechsler-Bellevue IQ for 503 patients. Low 
correlations, significant at the .01 level, were 
obtained for 13 of the variables. A multiple 
regression equation, based on the six variables 
showing the highest correlation with IQ, was 


1 This research was performed while the writer 
was a psychology trainee at the Veterans Adminis- 
tration Hospital, San Francisco, California. He wishes 
to express his appreciation to David A. Rodgers for 
his valuable assistance in the design and analysis of 
the experiment and in the preparation of the article. 


computed. When this equation was applied to 
a new sample of 207 patients, it failed to 
improve the accuracy of estimates beyond 
chance. In contrast, judges were able to make 
fairly accurate estimates of intelligence when 
exposed to the total protocols. The authors 
suggest the possibility that the judges were 
responding to the quality of the vocabulary 
and the quality of the perceptual organization 
in making their judgments. 

The present study is an attempt to deter- 
mine whether the vocabulary in Rorschach 
protocols is a variable that can be profitably 
utilized in estimating intelligence. Wechsler’s 
statement, “. . . the size of a man’s vocabu- 
lary . . . is an excellent measure of his gen- 
eral intelligence” (6, p. 98), has been gen- 
erally accepted. Vocabulary subtests are in- 
cluded in many intelligence tests. In both the 
Stanford-Binet (4) and Wechsler-Bellevue 
(6), the vocabulary subtest correlates higher 
with total IQ than do any of the other sub- 
tests. If the extensiveness of a person’s vo- 
cabulary can be judged from the Rorschach 
with some accuracy, this measure might be 
useful in estimating his intelligence. 


Procedure 


Three groups, each consisting of four judges, 
were asked to make numerical estimates of 
Wechsler-Bellevue IQ. Judges in Group I were 
given 16 Rorschach protocols. They were 
asked to list the seven “most sophisticated” 
words from each protocol and to base their 
IQ estimates on these word lists. Judges in 
Group II were each given one of the four 
sheets of word lists constructed by the judges 
of Group I. They were informed how the lists 
were derived and were asked to make IQ esti- 
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mates using only the word lists. They differed 
from Group I in having no access to the total 
protocol. Group III judges were handed the 
sixteen protocols and asked to estimate intel- 
ligence from them. In contrast to the others, 
Group III judges were not given specific in- 
structions on what to base their judgments. 

In addition, an attempt was made to meas- 
ure the extensiveness of vocabulary in a more 
objective fashion. The four word lists con- 
structed by Group I judges for each protocol 
were pooled. While these word lists over- 
lapped considerably, pooling revealed that 
from 10 to 20 different words had been se- 
lected as “most sophisticated” for each proto- 
col by the four judges, with a mean of 14.7. 
These words were looked up in Thorndike 
and Lorge’s The Teacher's Word Book of 
30,000 Words (5) to determine how fre- 
quently each word appears in everyday usage. 
A number that indicated how often each word 
appears among a million written words was 
obtained. The seven most infrequently ap- 
pearing words were then selected for each 
protocol and the frequencies of these seven 
words were totalled. Sixteen sums were thus 
obtained, one for each protocol. Rank-order 
estimates were made using the hypothesis that 
there would be an inverse relationship be- 
tween the size of the sum and IQ. 

The criterion measures of intelligence were 
based on full scale IQs on the Wechsler- 
Bellevue scale. Four Rorschach records were 
obtained from patients in each of the follow- 
ing IQ ranges: 120 and above, 110-119, 90— 
109, and 89 and below. Thus IQs were roughly 
rectangularly distributed, permitting compari- 
sons of the accuracy of judgments based on 
the total protocols (Group III) with the re- 
sults reported by Armitage et al. (1), who 
used a similar distribution. The protocols of 
male veterans who were referred for psycho- 
logical evaluation during hospitalization were 
selected from the files at San Francisco Vet- 
erans Administration Hospital. The tests had 
been administered by eight different examin- 
ers. Patients suspected of having organic 
brain pathology were excluded from the sam- 
ple. Only those cases whose Rorschach proto- 
cols were scored by the Klopfer and Kelley 
system (3) were used. The age of the patients 
in the sample ranged from 19 to 67, with a 
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Table 1 


Rho Correlations Between the IQ Estimates of Judges 
and Wechsler-Bellevue 1Q 


Group ITI 


Group II 





Group I 


Judge 





P vag 
.76** 
“a 


68** 


85** 
81** 
.66** 
.63** 


Pag 
67** 
6A** 


* Significant at or below the .05 level. 
** Significant at or below the .01 level. 


mean of 38.4. The mean number of years of 
education was approximately ten. From a 
diagnostic standpoint, the group was hetero- 
geneous and included psychoneurotic, psy- 
chotic, personality disorder, and psychoso- 
matic cases. 

The 12 clinicians used as judges all had ex- 
tensive training and experience in the use of 
the Rorschach and Wechsler-Bellevue. Most 
of them were graduate students nearing the 
completion of their Ph.D.s in clinical psy- 
chology. 


Results 


Correlation coefficients were used as indi- 
cators of the accuracy of judgments. In view 
of the rectangularity of the distribution, rank- 
order coefficients were employed. Table 1 in- 
dicates the correlations and the corresponding 
significance levels which were obtained by the 
judges. Results for Groups I and II indicate 
that estimates of intelligence can be made 
from Rorschach vocabulary. Since the ac- 
curacy of judges in Groups I and II is ap- 
proximately the same, it does not appear to 
make any difference whether judges making 
their estimates from word lists have access to 
the protocols or not. It is to be noted that the 
judgments based on total protocol are no 
more accurate than those based solely on the 
vocabulary. In addition, the estimates based 
on the Thorndike and Lorge word frequency 
count resulted in a rho of .77 (P< .01), 
which is roughly comparable to the accuracy 
achieved by the judges. The results for Group 
III are in agreement with those of Armitage 
et al. (1), who report a median correlation of 
.69 for estimates based on evaluation of the 
total protocols. 





Vocabulary and Estimating Intelligence from the Rorschach 


Discussion 


In comparison to the conventional Ror- 
schach scoring variables whose contributions 
toward estimating intelligence have been quite 
limited, vocabulary is a variable that im- 
proves accuracy of judgments considerably. 
To rely solely on vocabulary in estimating 
IQ would simplify the task and might well 
be the most practical approach at present. 
The results suggest that the clinician is not 
indispensable for making the estimates. The 
Rorschachs could be submitted to a clerk who 
is instructed to select the rarest words from 
each protocol and to look them up in a word 
frequency table. After finding the seven most 
uncommon words in each, he totals their fre- 
quencies and enters this total into an em- 
pirically derived equation or table, which 
yields a numerical estimate of IQ. 

The present investigation points the way 
toward areas of possible further research: 

1. It is quite likely that vocabulary could 
also be evaluated from samples of verbal be- 
havior other than the Rorschach. Thus esti- 
mates of intelligence might be made from the 
Thematic Apperception Test or from inter- 
views, possibilities first suggested by Armitage 
et al. (1). 

2. Since the Thorndike and Lorge word 
list is somewhat outdated and contains only 
30,000 words, it is to be expected that if a 
more current and extensive word list could be 
utilized, it would lead to somewhat more ac- 
curate estimates of IQ than those reported 
here. 

3. Since judges in Group III were not 
limited to specific cues in making their esti- 
mates of IQ, it is impossible to determine to 
what extent their accuracy of judgment was 


291 


due to reliance on vocabulary. It would be 
informative to determine whether comparable 
accuracy would be achieved when vocabulary 
is eliminated as a cue. This could be accom- 
plished by translating the protocols into basic 
English before giving them to the judges. 


Summary 


Three groups, each consisting of four judges, 
made estimates of Wechsler-Bellevue IQ from 
16 Rorschach protocols. One group used word 
lists which they constructed from the proto- 
cols, another used word lists derived by others 
from the protocols, and the third used the 
protocols without instructions to focus on vo- 
cabulary. In addition, estimates were made 
in a more objective fashion using the Thorn- 
dike-Lorge word frequency count. The three 
groups and the objective approach achieved 
fairly comparable accuracy. Results suggest 
that IQ can be estimated as well, using only 
vocabulary, as by any other approach cur- 
rently in use. 


Received August 15, 1957. 
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Extraversion, Neuroticism, Radicalism, 
and Tendermindedness'’ 


A. W. Bendig 


University of Pittsburgh 


Eysenck (2) has stated that the person- 
ality trait of Extraversion is negatively re- 
lated to the social attitude dimension of Ten- 
dermindedness, but is not related to the atti- 
tude trait of Radicalism. Neuroticism is not 
related to either attitude dimension. However, 
Cattell states that Extraversion is negatively 
related to Radicalism (1, pp. 317-318) and 
also implies a relationship between Neuroti- 
cism and Radicalism (1, p. 209), but the di- 
rection of the correlation is left unclear. The 
present research was designed to clarify some 
of the interrelationships among these four 
variables. 

The Maudsley Personality Inventory, con- 
taining scales of Extraversion and Neuroti- 
cism, was administered to 208 undergraduate 
Ss (117 men and 91 women) in five sections 
of introductory psychology. The scales of 
Radicalism and Tendermindedness developed 
by Melvin (2, pp. 276-280) were also ad- 
ministered to the same Ss. Comparisons of 
the mean raw scores of the two sex groups 
showed the female Ss to be more “tender- 
minded” than the male Ss, but the differences 
on the other three scales were not statisti- 
cally significant. 

Product-moment correlation coefficients 
among the four scales were separately com- 
puted within each sex group and the pairs of 
correlations tested for significant sex differ- 
ences by the r-to-z method. The correlations 


1An extended report of this study may be ob- 
tained without charge from A. W. Bendig, Dept. of 
Psychology, University of Pittsburgh, Pittsburgh 13, 
Pa., or for a fee from the American Documentation 
Institute. Order Document No. 5578, remitting $1.25 
for microfilm or $1.25 for photocopies. 


were averaged when these tests did not ap- 
proach significance. The average correlation 
between Extraversion and Neuroticism (r = 
— .12) was significant at the .10 level, but 
the correlations of Extraversion with Radi- 
calism and Tendermindedness were not sig- 
nificant (r = — .04 and — .01). The average 
correlation between Neuroticism and Radical- 
ism (r = .22) was significant at the .01 level 
and similar in both sex groups (r = .18 and 
.26). The correlations of Tendermindedness 
with Neuroticism and Radicalism were signifi- 
cant for the male Ss (r = — .28 and — .19), 
but not for the female Ss (r = — .03 and .08), 
and the sex differences in these correlations 
were significant at the .10 level (z = 1.79 and 
1.95). 

These results indicate that Eysenck’s link- 
ing of Extraversion with Tendermindedness 
and Cattell’s association of Extraversion and 
Radicalism were not confirmed by the data. 
However, Cattell’s hypothesized relation of 
Neuroticism and Radicalism did appear 
within both sex groups of Ss. The sex dif- 
ferences among the correlations of Tender- 
mindedness with Radicalism and Neuroticism 
may represent either a sampling fluctuation 
or a different organization of these three 
traits in men and women Ss. 


Brief Report. 
Received March 7, 1958. 
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of Human Figure Drawings’ 
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Projective theorists have suggested inter- 
pretations for many aspects of human figure 
drawings, among them the position of the 
drawing upon the page. Thus, Karen Mach- 
over (2) proposed that the location of the 
drawing toward the right side of the page in- 
dicates that the subject is environment-ori- 
ented, while a left-side location is character- 
istic of a self-oriented person. A drawing high 
on the page is related to optimism, a low- 
positioned drawing suggests depression. Levy 
(1) proposed that drawings on the left of the 
page indicate self-consciousness or introver- 
sion, toward the right, rebelliousness. For 
Levy. the use of the upper half of the page 
signifies striving toward high standards of 
achievement whereas persons who place draw- 
ings toward the lower edge of the sheet are 
likely to be stable, firmly rooted, or, in some 
cases, depressed. That these interpretations 
are based upon resemblances in meaning be- 
tween words used to describe space (high, 
low, etc.) and words used to describe person- 
ality (elated, depressed, etc.) seems obvious. 
Whether such linguistic resemblances provide 
a valid basis for hypothesizing relationships 
between drawings and personality has not as 
yet been adequately determined. 


1 We wish to express our appreciation to the Leba- 
nese, Egyptian, and American schools who provided 
data for this study. The Egyptian and Lebanese data 
were gathered in 1956-1957 when the author was a 
visiting professor at the American University of 
Beirut, and the gathering and analysis of these data 
were supported by a grant from the Rockefeller 
Brothers Fund. Special thanks are due to Habib 
Kurani and E. Terry Prothro of the American Uni- 
versity of Beirut for their cooperation and encour- 
agement and to Evelyn Raskin of Brooklyn College 
for consultation and suggestions. 


Whatever may be the final decision in re- 
gard to the relationships between personality 
and graphic expression, it seems likely that 
not all of the variance present in drawings 
will be accounted for by individual differences 
in temperament and personality. There may 
be determinants of drawing position which 
are cultural and which have little if any rela- 
tionship to personality. Culture determines the 
position on a page at which a person begins 
to write. If there is a habit-transfer between 
writing and drawing, the position of draw- 
ings will be affected by handwriting habits. 
This study examines the drawing location of 
two classes of writers; namely, those who be- 
gin to write at the top left of the page and 
those who begin to write at the top right of 
the page. 


Method 
Subjects 


The writers who start at the top left of the 
page are represented by two subgroups, those 
writing English and those writing Armenian. 
In nationality, these Ss are Americans and 
Lebanese, respectively. Those starting at the 
top right of the page are Arabic writers. Their 
nationalities are Lebanese and Egyptian. All 
are children in elementary schools between 
ages five and ten. 

The American Ss come from three widely. 
separated localities: (a) a small town in 
Maryland, (4) a small town in Kansas, and 
(c) a suburban town near New York City. 
The Armenian Ss attended two schools in 
Beirut, Lebanon. The Ss comprised either 
complete samples or random samples from 
their respective schools. The total number of 
Ss in the above groups was 1,075. 
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Some of the Arabic-writing Ss were attend- 
ing a public school in Port Said, Egypt. All 
others were attending public or private schools 
in Lebanon. These two groups totalled 651 
Ss. These Ss, like the Armenian and Ameri- 
can groups were, so far as could be deter- 
mined, representative of their schools. 


Testing and Scoring Procedures 


The drawing test employed was the Good- 
enough draw-a-man test. Goodenough’s in- 
structions were followed. All drawings were 
made in pencil on 83” by 11” white paper. 

In scoring each drawing for the purposes of 
the present study, it was indicated, first, 
whether the topmost part was nearer or the 
bottom-most part was nearer their respective 
edges of the paper. Second, it was noted 
whether the left or right side of the drawing 
(left and right being designated from the ob- 
server’s point of view) was nearer the lateral 
edges of the page. In judging the vertical lo- 
cation, the most extreme part of the drawing, 
such as the top of the hat or a projecting 
feather, was used. However, right and left 
judgments were based upon the most lateral 
parts of the body, excluding the extremities. 
In those drawings which lacked a body, judg- 
ments were based upon the head. When there 
was any doubt as to the judgments, measure- 
ments were made. When two measurements 
were identical, which seldom occurred, scores 
were assigned alternately to the two cate- 
gories. 


Results 
In terms of the scoring method just de- 
scribed, a drawing must receive one of four 


Table 1 


Position of Figure Drawings on a Page 








Armenian- 


Location American 


Arab 





Top left 

Top right 
Bottom left 
Bottom right 


48% 
16% 


24% 
487% 
27% 14% 
9% 14% 


N 1,075 651 





Wayne Dennis 


scores: top left, top right, bottom left and 
bottom right. Top left means that the draw- 
ing was nearer the top of the paper than to 
the bottom, and nearer the left edge than the 
right edge. The other terms have correspond- 
ing meanings. 

Table 1 shows the percentage of drawings 
falling into each category for each of the 
groups. 

Table 1 shows that in the Armenian-Ameri- 
can group, 48% of the drawings have a top- 
left position, whereas in the Arab group, 48% 
fall in the top-right group. 


Discussion 


Obviously, these findings are in accord with 
the habit-transfer hypothesis suggested ear- 
lier. Also in accord with this interpretation is 
the finding that 64% of the Armenian and 
American drawings and 72% of the Arab 
drawings are nearer the top of the page than 
they are to the bottom. 

However, while all Armenians and Ameri- 
cans begin to write in the top-left portion of 
the page and all Arabic-writing Ss start their 
handwriting in the top-right quadrant, not all 
members of these groups place their drawings 
in the corresponding positions. Handwriting 
customs appear to have some influence in 
producing differences between certain groups, 
but they cannot, of course, explain the within- 
group variability. While the data here pre- 
sented are consonant with the handwriting 
hypothesis, they are not entirely conclusive. 
The two groups compared no doubt differ in 
other respects as well as in regard to hand- 
writing. It is unlikely that one can find two 
groups differing in handwriting but identical 
in all other respects. However, if one were to 
find that many groups differing in handwrit- 
ing also differ in a consistent way in location 
of drawings, our hypothesis would seem to be 
firmly supported. Until a wider sampling of 
groups is obtained, the data here presented 
should serve as a caution in regard to inter- 
preting cross-cultural differences in drawing 
locations as indicative of differences in per- 
sonality. Such differences may be the result 
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of habit patterns having little relationship to 
emotional or social traits. 


Summary 


Evidence has been presented which is con- 
sonant with the hypothesis that the location 
of drawings depends in part upon whether 
subjects in writing their primary language be- 
gin at the top left or top right of the page. 
Further comparisons of other language groups 
are needed to test the hypothesis, but, mean- 
while, caution in the interpretation of cross- 
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cultural differences in location of drawings is 
indicated. 


Received July 23, 1957. 
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The Relationship Between Edwards’ and McClelland’s 
Measures of Achievement Motivation’ 


Levon H. Melikian 


American University of Beirut 


Independent measures of achievement mo- 
tivation have recently been developed by Ed- 
wards (1) and McClelland e¢ al. (2). Ed- 
wards’ measure of this variable is one of fif- 
teen that have their origin in, and bear the 
same names as, a list of manifest needs pre- 
sented by Murray (3). They form the Ed- 
wards Personal Preference Schedule which 
measures a “number of relatively independent 
normal personality variables” (1, p. 1). Mc- 
Clelland’s method of measuring n Achieve- 
ment is also rooted in Murray, since he uses 
imaginative stories evoked by two TAT cards 
and two other pictures designed by him and 
his associates. He has developed a quantita- 
tive method of scoring the imaginative mate- 
rial in terms of n Achievement categories (2, 
pp. 107-138). 

Since both Edwards and McClelland inde- 
pendently measure the same variable, a high 
correlation is expected between the scores ob- 
tained on their two measures. The purpose of 
this study is to test whether such a relation 
exists or not. 


Method 


The subjects (Ss) in this experiment were 
84 Arabic-speaking students in two psychol- 
ogy classes at the American University of 
Beirut. Approximately half of the Ss were 
citizens of Lebanon, and the other half came 
from Jordan, Syria, Iraq, and Bahrain. All 
the Ss had a good command of English, which 
is the language of instruction in the Univer- 
sity. Sixty of the Ss were males. 


1 The author wishes to acknowledge his indebted- 
ness to Aimee Sabbagh and Sumaya Kurani, of the 
Department of Psychology, for scoring the imagina- 
tive stories on the McClelland Test. 


The full PPS was administered to the Ss 
as a class demonstration in the use of the 
questionnaire method. The Ss were told that 
E was interested in collecting norms on uni- 
versity students for help in his counseling 
work and were also told that these norms 
would be available to the class members. The 
Ss were told to use a private identification 
mark on their answer sheets and were asked 
to keep a record of it. A week later the four 
pictures used by McClelland were projected 
on a screen, and each S was given four sheets 
of paper, on each of which were printed the 
four standard questions used by McClelland 
about the pictures which the Ss were asked 
to answer. The instructions and method used 
by McClelland (2, p. 98) were followed 
throughout. The Ss were asked to use the 
same identification mark they had used on 
the PPS the week before. This was also pre- 
sented to the class as a demonstration of pro- 
jective tests which were to be discussed dur- 
ing the following class session. Both these 
tests were given under similar conditions, and 
the situation approximates McClelland’s “neu- 
tral” condition of testing n Achievement. 

The two tests were scored according to the 
methods prescribed by their authors. Fifteen 
scores were available for each subject on the 
PPS. The imaginative stories of the 84 Ss 
were independently scored by two research 
assistants and the interscorer reliability cor- 
relation was .93, which is considered to be 
satisfactory. Only the 69 Ss, 50 males and 19 
females, who completed both tests were in- 
cluded in the final analysis. The age range for 
the 68 Ss who had indicated their age (on 
their last birthday) was 18 to 29, with a 
mean of 21.1 and a standard deviation of 2.2. 
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Edwards’ and McClelland’s Measures of Achievement 


Results and Discussion 


Table 1 shows the means and standard de- 
viations on Achievement for our Ss and those 
of Edwards. The difference between the two 
means is not significant, ¢ being .52, which 
points to the probable validity of this ques- 
tionnaire for a non-American culture. To 
verify this, the Achievement items on the 
PPS were presented to three Arab professors 
in the fields of psychology, Arabic history, 
and Arabic literature. They were asked to 
judge whether any of the items were in any 
way alien to the Arabic culture and whether 
they could be taken as indices of achievement 
for the Arab group. There was full agreement 
among the judges about the validity of these 
items for an Arabic-speaking culture. 

The mean score of the Ss on the imagina- 
tive stories was 2.22 with a standard devia- 
tion of 4.26. No comparable data are avail- 
able for American groups. Even though such 
information is not available, a glance at the 
distribution of the different categories of 
achievement scores in Table 2 points to the 
possibility of cultural differences on certain 
categories. These differences will be discussed 
in a separate paper as they are not directly 
related to the present study. Evidence regard- 
ing the cross-cultural validity of the scoring 
method used is presented by McClelland (2, 
pp. 168-172). 

The correlation between the scores of the 
69 Ss on the PPS achievement variable and 
the McClelland achievement scores is .16. 
Since an r of .16 is not significant, it indi- 
cates an absence of a relationship between 
two methods that claim to measure the same 
variable. 

Like all motives, the achievement motive is 
rather complex. McClelland (2, p. 217) de- 
scribes his score as that of a “composite index 


Table 1 


Mean Achievement Scores of Arab and American* 
Subjects on the PPS 


Group N Mean 





Arabs 84 
Americans 1509 


14.65 
14.38 








* U. S. scores taken from Edwards (1, p. 6). 


Table 2 
Frequencies of Various Achievement Categories* under 
q 


Neutral Experimental Conditions for Both Arab 
and American Subjects” 








Achievement category 


Americans Arabs 





Achievement Imagery (AI) 75 70 
Doubtful Imagery (TT) 49 102 
Unrelated Imagery (UI) 49 112 
Need (N) 35 25 
Instrumental Activity (IT) 31 33 
Positive Anticipatory 

Goal State (Ga +) 14 
Negative Anticipatory 

Goal State (Ga —) 1 
Personal Obstacle (Bp) 
Environment Obstacle (Bw) 10 
Positive Affect State (G +) 16 
Negative Affect State (G —) 4 13 
Nurturant (Nup) : 5 
Achievement Theme (Ach. Th.) 62 58 





® The total number of stories for the Arab group is 276 (69 Ss, 


4 stories each); and for the American group is 156 (39 Ss, 4 
stories each). 


> Data on the American S is taken from McClelland (2, p. 
141). 


of a variety of types of achievement motiva- 
tion” which he demonstrates as having a 
meaningful relationship to varied types of 
behavior. Among the theoretical distinctions 
which he makes between different types of 
achievement is one type that is characterized 
chiefly by a “hope of success” and another 
by a “fear of failure” (2, p. 215). 

This theoretical distinction may possibly 
account for the absence of a relationship be- 
tween the two measures. An inspection of the 
PPS items which measure achievement mo- 
tivation shows that they refer to competition 
with some standard of excellence similar to 
the “hope for success” type of achievement. 
“To do one’s best, to be successful, to accom- 
plish ...todo...,tosolve..., todo 
things better than others . . . etc. . . .” are 
given by Edwards (1, p. 5) in describing his 
Achievement variable. 

Besides the above, responses to the Ed- 
wards items are made to clearly structured 
stimuli, whereas the imaginative stories are 
responses to a variety of cues both internal 
and external, in the environment and the pic- 
tures, as well as to experimentally induced 
cues. This difference points to the probability 
that the two measures tap different levels of 
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achievement, a manifest level, and a latent 
level (2, pp. 196-217), and may account for 
the absence of a relationship between the two 
measures. 

In the light of the findings, and in the in- 
terest of avoiding confusion, it is suggested 
that the two measures of achievement motiva- 
tion be either renamed or qualified. 


Summary 


‘The PPS and the McClelland pictures for 
measuring achievement were given a week 
apart to a group of 84 Arabic-speaking Ss at 
the American University of Beirut. No rela- 
tionship was found between the achievement 
scores of the 69 Ss who completed both tests. 


Levon H. 


Melikian 


The results were discussed in terms of the 
complexity of the achievement motive and, 
also, of the probability that the two methods 
measure the achievement motive at different 
levels. 


Received August 2, 1957. 
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Second-Order Factors in Personality Measurement 
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It is surprising that to date so few pub- 
lished studies on the Sixteen Personality Fac- 
tor Questionnaire have appeared which have 
not emanated from the Laboratory of Person- 
ality Assessment at the University of Illinois. 
The reasons for this lack of popularity are not 
readily apparent, since Cattell and his co- 
workers have published widely and frequently 
(3, 4, 5, 7, 8). Karson and Pool (11) found 
much similarity between the 16 P.F. and the 
MMPI which they interpreted as offering sup- 
port for the construct validity of the 16 P.F. 
test. Their major criticism of the test was that 
the coefficients of equivalence of the factors on 
Forms A and B were not nearly high enough 
to justify their being used as equivalent forms, 
and that even after being corrected for full 
length with the Spearman-Brown prophecy 
formula, the resultant coefficients of half of 
the 16 factors were still not high enough to 
warrant their use for accurate individual per- 
sonality description. 

In a recent paper, Cattell (5) reported 
four second-order personality factors which he 
identified as Anxiety-vs.-Dynamic Integration, 
Extraversion-vs.-Introversion, Cyclothyme-vs.- 
Schizothyme Constitution, and Unbroken Suc- 
cess-vs.-Frustration. These findings were based 
on a sample of 408 normal subjects composed 
of 181 undergraduate students of both sexes 
and 227 U.S. Air Force young men in train- 
ing. The aim of the present study was to at- 


1 The data for this study were collected and the 
statistics computed while both authors were at the 
School of Aviation Medicine, USAF. 

2 A summarized version of this paper was presented 
at the 1957 meetings of the APA in New York City 
by the senior author. 


tempt to replicate the second-order factors de- 
scribed by Cattell in order to determine the 
degree of stability of these factors over differ- 
ent samples. From a theoretica! viewpoint, if 
different samples resulted in yielding unre- 
lated factor structures, then the argument (4, 
p. 3) that factors actually correspond “‘to psy- 
chological functional unities rather than to 
mere mathematical conveniences” would be 
highly questionable. 


Sample 


The sample consisted of 71 USAF officers 
who were referred to the School of Aviation 
Medicine for medical and psychological evalu- 
ation. The average age was 33, with a range 
of 20 to 39 years, and the mean number of 
years of education was 14, with a range of 12 
to 16 years. The mean Full Scale IQ on the 
Wechsler Adult Intelligence Scale was 121.98, 
with a standard deviation of 6.50. The pa- 
tients were referred for evaluation between 
August, 1955 and July, 1956. The sample em- 
ployed in this research is believed to be typi- 
cal of the flying personnel who are referred 
for psychiatric consultation in the U.S. Air 
Force. 

None of the patients was psychotic, but 
almost all were manifesting some disturbance 
in functioning efficiency which was impairing, 
at least temporarily, their ability to perform 
their duties as USAF flying personnel. Upon 
clinical examination, which included psychi- 
atric and psychological evaluation, there was 
general agreement that anxiety and its dis- 
rupting effects usually played a key role in 
the etiology of the patients’ symptomatology, 
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Table 1 


Intercorrelations* of Factor Scores on the Sixtcen P.F. Test 


Score A B Cc E I 





02 12 13 


—.12 .08 
22 


4 


—.12 


ROA mMEAws 


le ee 


Forms A and B Combined (N = 71) 








* For significance at the .05 level of confidence a Pearson r must equal .24. 


even though an actual incapacitating, diag- 
nosable neurosis was seldom present. It should 
be remembered that the sample was composed 
of mentally and physically superior experi- 
enced military flying officers, many of whom 
had successful combat records in either World 
War II or Korea. 


Method and Results 


The 16 P.F. test (Forms A and B) was ad- 
ministered as a part of the routine diagnostic 
psychological test battery. The two forms of 
the test were alternated so that about one-half 
of the sample took Form A first, while the 
other half had Form B initially. Both forms of 
the test were usually administered on the same 
day, one in the morning and the other in the 
afternoon. The tests were scored according to 
the directions given in the handbook. The raw 
scores were converted into standard scores by 
using the sample of male college students and 
high school seniors whose average age was 21 
as the reference group. Pearson coefficients of 
correlation among the standard scores for the 
16 factors were computed to achieve the ma- 
trix of intercorrelations presented in Table 1.° 


8 Although this procedure for obtaining the matrix 
of intercorrelations is admittedly not as precise as 
when one begins with the cosine matrix among the 
siraple structure primary factors and then inverts this 
matrix to obtain the correlations among the factors, 


The factor analysis was then done, using 
Thurstone’s centroid method (12). As guides 
for when to stop factoring, criteria suggested 
by Burt and Banks (1) and Guilford and 
Lacey (10) were used. After six factors had 
been extracted, an estimated 109% of the 
common factor variance had been accounted 
for, and the sixth factor residuals ranged in 
size from — .068 to .093. Rotation to simple 
structure was then attempted without regard 
to positive manifold. As a guide in rotation, 
the goal was to maximize the number of vari- 
ables with zero projections on the reference 
vectors. The factors were rotated blindly two 
at a time by means of graphs and six inde- 
pendent, second-order factors were extracted.* 
The new factor loadings and their communali- 
Cattell has previously employed this method (4, p. 
18). On this same issue, Cattell (5, p. 412) states: 
“However, it must be granted that where factors are 
already very purely measured by existing tests, with 
no systematic bias and with effective suppression of 
peer factors in each factor score, the factoring of 
actual factor scores, ie., of ome variable for each 
factor, may yield a reasonable approximation to 
second-order factors. This seems to have happened 
with the 16 P.F. test in the large Air Force study.” 

*The centroid factor matrix, the matrix of sixth 
factor residuals and the direction cosines have been 
deposited with the American Documentation Insti- 
tute. Order Document No. 5586, remitting $1.25 for 
microfilm or $1.25 for photocopies. 
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Table 2 
Rotated Second-Order Factor Loadings 


First 
order 
factor 4 [ 4 h? 


02 .3951 

O07 .2187 

ot 08 .8739 

. rir 37 ~.5410 

04 -02.. 03 .6175 

08 O01 08 .3895 

-04 19 —. 00 8778 

01 03 —. 0 5244 

00 —.08 —-. 08 .6038 

00 —.23 —: 32 6757 

30 «(07 27 .1849 

81 4 18 12 ‘ 09 8347 
ma. 4 A7 —17 —.02 —.05 .3687 
21 wy 0S —.47 26 —.01 .3881 
54 ; 03 O01 34 04 4459 
80 21 19 —04 —.04 .7270 





ties after rotation of axes are presented in 
Table 2. 


Description of the Factors 


The chief indications of the nature of Fac- 
tors 1 through 6 are presented below with a 
contingent title given to each factor. 


Factor 1. Anxiely-vs.-Dynamic Integration 
Symbol 16 P.F. Handbook Title Loading 
Cc Ego Strength — 84 
oO Anxious Insecurity 81 
Q, Nervous Tension 80 
M Hysteric Unconcern 72 
Paranoid Schizothymia 65 
Will Control & Character Stability —.54 
Radicalism 34 
Adventurous Autonomic Resilience —.33 
Emotional Sensitivity 32 


This factor shows a marked resemblance to 
the second-order factor reported by Cattell 
(5) on which Q3, O, Q4, L, C, and H loaded 
heavily. He identified this factor as Anxiety- 
vs.-Dynamic Integration and we have adopted 
his title. Further support for this identifica- 
tion is forthcoming from an earlier study by 
Cattell (4) in which a factor described by him 
as Anxiety-vs.-Self-Assuredness was reported. 
The 16 P.F. test factors, 04, O, O3, C, L, M, 
and H all loaded on this factor. 

Additional support is provided by Karson 
and Pool (11) who reported that the MMPI 


Symbol 
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psychasthenia (Pt) scale correlated signifi- 
cantly with factors C, O, Q4, M, L, Qs, H, 
and J. Moreover, the factors with significant 
loadings on Factor 1 correlated more highly 
with the Pt scale than any of the other 13 
MMPI scales, with the possible exception of 
Sc which correlated .89 with Pt for the pres- 
ent sample (11). High correlations between 
Pt and the Taylor Anxiety scale have been 
reported by Eriksen and Davids (9) and by 
Brackbill and Little (2). 


Factor 2. Extraversion-vs.-I ntroversion 


16 P.F. Handbook Title Loading 
E Adventurous Autonomic Resilience a4 
F Surgency 

E Dominance 

A Cyclothymia a 

O Anxious Insecurity —.3 


Factor 2 also shows much similarity to 
another of Cattell’s second-order factors on 
which M, F, A, Q;, Qe, and H had significant 
loadings. He identified this factor as Extra- 
version-vs.-Introversion, and once again we 
have adopted his title (5). Cattell reports 
that in still another study (5, p. 416) the Z 
factor (which we found) also enters in and, 
moreover, that an alternative rotation which 
was considered for his Factor 2 on 4 would 
also have done so again. The reasons why Q, 
did not load on Factor 2 in the present study, 
while O did, are unknown, although it is be- 
lieved that sampling differences may be re- 
sponsible for these variations. 

Karson and Pool (11) provide additional 
support for the identification of Factor 2 in- 
asmuch as they reported significant correla- 
tions between certain 16 P.F. scores and 
particular MMPI scale scores which appear 
relevant to this identification. 

The present findings do not, however, sup- 
port Cattell’s assertion (5, p. 416) that the 
“relatively low loadings of the more heredi- 
tarily determined H and A factors suggest 
that Factor 2 is an environmentally deter- 
mined pattern.” Factor H had the highest 
loading of all in the present study. Neverthe- 
less, his conclusion (5, p. 417) that “There 
now seems ground for recognizing extraver- 
sion-introversion as something more than a 
correlation cluster or surface trait, and per- 
haps for specializing the term for this measur- 
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able second-order factor” is in agreement with 
the present findings. 


Factor 3. Unbroken Success-vs.-Frustration 
Symbol 16 P.F. Handbook Title Loading 


QC: Conservatism — 47 
B Genera! Intelligence 38 
N Lack of Sophistication (Naivete) — 30 
Cc Ego Strength 27 


Factor 3 shows some resemblance to Cat- 
tell’s factor of Unbroken Success-vs.-Frustra- 
tion inasmuch as Q,; and WN are present in 
both patterns as well as high intelligence from 
an objective behavior study (5, p. 417). 

Karson and Pool (11) demonstrated, how- 
ever, that for the present USAF officer sam- 
ple, Factor B was not a valid measure of 
intelligence. They also reported significant 
correlations between B and Mf and J, re- 
spectively, and between Q, and F, Hy and 
Mf from the MMPI. This pattern of relation- 
ships makes Factor 3 difficult to identify. The 
present authors have chosen to adopt Cattell’s 
title of Unbroken Success-vs.-Frustration, al- 
though a rival hypothesis of Spontaneity-vs.- 
Inhibition was also entertained. 


Factor 4. Independence-vs.-Dependence 
Symbol 16 P.F. Handbook Title Loading 
Q: Lack of Resolution — A7 


A Cyclothymia 34 
E Lack of Dominance (Submission) — 33 


Factor 4 has some similarity to a second- 
order factor (4, p. 40) of Schizothyme Te- 
nacity-vs.-Cyclothymia on which Factors E, 
F, H, A, and Qe2 loaded. This identification is 
difficult to reconcile, however, with 2 more 
recent second-order factor again identified by 
Cattell (5, p. 417) as Cyclothyme-vs.-Schizo- 
thyme Constitution on which Factors A, /, 
N, Qs, and C loaded significantly. 

Karson and Pool (11) reported significant 
correlations between Qe and Si and Mf, be- 
tween A and Si, and between E and Mf, Si, 
and Sc, respectively. In view of this pattern 
of interrelationships, Factor 4 has been tenta- 
tively identified as Independence-vs.-Depend- 
ence. The handbook (8) for the 16 P.F. lends 
support for this hypothesis by its description 
of the Q2 — person (8, p. 11), the A — and 
A + person (8, p. 6), and Factor EZ as well 
(8, p. 7). 
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Factor 5. Sociopathic Deviancy 
Symbol 16 P.F. Handbook Title 


G Lack of Super-Ego Strength 

L Lack of Paranoid Schizothymia 
0; Lack of Will Control 

Q2 Lack of Resolution 


Loading 


—.61 
— Al 
—.34 
—.26 


The identification of this factor is especially 
difficult, principally because the two factors 
with the highest loadings, namely G and L, 
are among the ones about which least is 
known. Karson and Pool (11) reported that 
none of the MMPI scale scores correlated sig- 
nificantly with Factor G which, strangely 
enough, had its highest correlations with Pd 
and K. These findings as well as the authors’ 
clinical experience with this test over a two- 
year period raise some doubt as to the identi- 
fication of G as a factor of “effective” super- 
ego pattern (6). Factor L is also puzzling; it 
did not correlate with Pa; however, it corre- 
lated more highly with Pt and Sc than did Pa. 
Moreover, the high loading of LZ on the sec- 
ond-order factor of Anxiety-vs.-_Dynamic In- 
tegration, which has been found consistently 
(4, 5, 7), does not appear to have been satis- 
factorily explained on a theoretical basis al- 
though such an explanation has been at- 
tempted (7, p. 230). 

Two hypotheses are suggested for the 
identification of Factor 5; first, that it is a 
factor of sociopathic deviancy and, second, 
that it represents a lack of obsessive-compul- 
sive personality characteristics in our sample. 
The latter interpretation would necessitate a 
reexamination of Factor Z with emphasis 
given to its anxiety and rigidity aspects rather 
than to paranoid suspicion. At any rate, it is 
encouraging to see some relationships emerge 
between the highly esoteric world of factor 
analysis and the more familiar conceptualiza- 
tions of psychiatric nosology. 


Factor 6. Latent Homosexuality (Passive) 


Symbol 16 P.F. Handbook Title Loading 


I Emotional Sensitivity 60 
E Lack of Dominance (Submission) —.37 
M Hysteric Unconcern 32 
N Lack of Sophistication (Naivete) —.27 


The particular constellation of loadings on 
this factor suggests the hypothesis of latent 
homosexuality of a passive nature. Some sup- 





Second-Order Factors in Personality Measurement 


port for this identification comes from the sig- 
nificant correlation reported between / and 
Mf (11) and from the description of the J + 
person presented in the 16 P.F. handbook (8, 
p. 9). This factor may serve to throw light on 
the psychodynamics of military flyers, par- 
ticularly with regard to the frequently made 
clinical observation of “masculine strivings”’ 
for this group. 

Some mention should be made of the fact 
that Cattell (5) used an oblique solution in 
obtaining the four second-order factors which 
he described, while the present study reports 
six independent factors. These differences not- 
withstanding, the present findings appear to 
be encouraging for those who believe in the 
stability and reality of factorial solutions. 
Why two of Cattell’s factors correlated .43, 
while the same two factors were not signifi- 
cantly correlated in the present study, is not 
readily apparent although this discrepancy 
may be a function of methodological differ- 
ences, sampling differences, or both. 


Summary 


In order to evaluate the stability of fac- 
torial solutions over different samples, an at- 
tempt was made to replicate the second-order 
personality factors reported by Cattell. Forms 
A and B of the 16 P.F. were administered to 
a sample of 71 USAF officers on flying status, 
who had been referred to the School of Avia- 
tion Medicine, USAF, for medical and psy- 
chological evaluation. The test scores were 
punched on IBM cards, and a correlation 
matrix was derived using Pearson product- 
moment correlation coefficients. The factor 
analysis was done using Thurstone’s centroid 
method. Rotation to simple structure by 
means of graphs yielded six orthogonal, sec- 
ond-order personality factors. These were de- 
scribed as follows: 1. Anxiety-vs.-Dynamic 
Integration; 2. Extraversion-vs.-Introversion ; 
3. Unbroken Success-vs.-Frustration; 4. Inde- 
pendence-vs.-Dependence; 5. Sociopathic De- 
viancy; 6. Latent Homosexuality (Passive). 
The first three factors named above appear to 
be replications of three of the four second- 
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order factors reported by Cattell and his titles 
for them were used. The remaining differ- 
ences in the nature and number of second- 
order factors found between the two studies 
can probably be explained at least partially 
by methodological differences, by the varying 
degree of simple structure actually attained, 
and by differences in age, status, and sex as 
well as in the personality characteristics of 
the two samples. 
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The purpose of this paper was to investigate 
the Minnesota Multiphasic Personality In- 
ventory (MMPI) performance of two groups 
of disabled veterans to determine whether 
there were areas of significant difference in 
terms of personality functioning. Particular 
attention was devoted to the K correction 
scale to determine if significant differences 
existed in the “test-taking attitude” of veter- 
ans with psychiatric as compared to those of 
organic disabilities. 

The setting of the study was a VA coun- 
seling service for seriously disabled veterans. 
The Ss consisted of a group of 50 veterans 
having VA ratings for tuberculosis and gun- 
shot wounds and a like number of veterans 
with VA ratings for schizophrenia. The Ss 
were selected at random from the population 
of male counseling cases seen within the past 
three years. 

The MMPI scores of the psychiatric (P) 
group were compared with the organic (O) 
group, and both of these groups were com- 
pared with the MMPI standardization group 
on several of the scales, and with a group of 
nondisabled “normal” veteran controls. 

Examination and statistical analyses of the 
data revealed the following: 


1An extended report of this study may be ob- 
tained without charge from Joseph J. Motto, 8635 
Elmira, Detroit 4, Michigan, or for a fee from the 
American Documentation Institute. Order Document 
No. 5579, remitting $1.75 for microfilm or $2.50 for 
photocopies. 


1. MMPI scales, Pt and Sc, significantly 
differentiated those of organic disabilities 
from those of schizophrenic disabilities. 

2. In terms of personality characteristics 
measured by the MMPI, the organic group 
was more similar to the psychiatric group 
than they were to a group of nondisabled, 
“normal” veteran controls. 

3. The presence of an organic disability re- 
sulted in Hs scores which were not signifi- 
cantly higher than those produced by the psy- 
chiatric group. 

4. The K scale functioned as predicted by 
test authors and conformed to their rationale 
of the suppressor variable when applied to 
those with organic disabilities, the sole excep- 
tion being the Mf scale where it appears a K 
correction weight is needed for this group. A 
similar trend to an inverse relationship be- 
tween K and Mf was seen in the P group. 

5. There were indications of the need for: 
an additional K correction weight for veter- 
ans with psychiatric disabilities on Sc, Ma, 
and Pa scales. 

6. Compared to the control group, both 
disability groups produced higher F scores. 
The organics differed from the normals by 
the presence of an elevated neurotic triad, 
whereas the psychiatric group differed from 
the normals by elevations of both the neu- 
rotic and the psychotic triads. 

7. K was found to have a high positive re- 
lationship with L and a significant inverse re- 
lationship with F. 


Brief Report. 
Received March 10, 1958. 
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Several investigators (1, 2, 3) have made 
use of one or both of two experimental scales 
derived by us in the course of a cluster analy- 
sis of the Hysteria scale of the MMPI. The 
purpose of this note is to describe briefly the 
development of these scales and to indicate 
some of their correlates in clinical and experi- 
mental work. 

As part of the study of differential person- 
ality patterns in diagnostic center and general 
medical patients in a Veterans Administration 
hospital, it was observed that the Hysteria 
scale score was the highest point on the 
MMPI profiles of both groups. In both, the 
mean J score exceeded 70 despite the fact 
that the general medical patients were hos- 
pitalized for nonpsychiatric disorders. The ob- 
servation of an elevated Hy score in general 
medical patients is, of course, a common one. 
On the assumption that these elevated scores 
had a different meaning in a nonpsychiatric 
population than in one with some history of 
psychologic disorder, we initiated a cluster 
analysis of the matrix of tetrachoric intercor- 
relations among the 60 Hysteria scale items. 
The patient population on which the correla- 
tions were derived consisted of 60 diagnostic 
center neuropsychiatric subjects and 90 ran- 
dom admissions to the general medical wards. 

Nine relatively clear-cut clusters were ob- 
tained. Five of these clusters had moderate to 
high positive correlations among themselves 
and zero or slightly negative correlations 

1Now with the Training and Standards Branch, 


National Institute of Mental Health, Bethesda, 
Maryland. 


with the remaining four. The latter group of 
clusters had, conversely, moderate to high 
positive correlations within itself. The findings 
suggested, of course, that two primary di- 
mensions underlay the cluster structure. The 
group of five clusters (comprising 23 of the 
Hy items) was composed for the most part of 
statements which consist largely of physio- 
logic symptoms to which the subjects “ad- 
mitted.” This group was labeled the Admis- 
sion (Ad) scale. 

The second scale was composed of the 19 
items falling into the group of four clusters. 
These items consisted for the most part of 
items scored if the subject denied their va- 
lidity for him. In content, they were state- 
ments about poor interpersonal relations, feel- 
ings of hostility, suspicion, and the like. The 
second cluster was therefore called the Denial 
(Dn) scale. As a final step, biserial correla- 
tions were computed between each of the re- 
maining 18 Hysteria scale items and the Ad 
and Dn scale scores. Items were added to the 
scale with which they had the highest correla- 
tion. Two items correlated zero with both 
scales and were discarded. 


The item composition of the two scales (as they 
are numbered in the group form booklet of the 
MMPI) is as follows: Ad (32 items)—Items 2, 3, 7, 
8, 9, 55, 103, 107, 128, 137, 153, 160, 163, 174, 175, 
188, 190, 192, 230, 243 scored if marked False, and 
items 10, 23, 32, 43, 44, 47, 76, 114, 179, 186, 189, 
and 238 scored if marked True. Dn (26 items)— 
Items 6, 12, 26, 30, 71, 89, 93, 109, 124, 129, 136, 
141, 147, 162, 170, 172, 180, 201, 213, 234, 265, 267, 
279, 289, 292 scored if marked False, and item 253 
scored if marked True. All 58 items are scored in the 
same direction as they are scored on the Hy scale. 
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Correlations between the new scales and 
other MMPI scales for two different groups 
of subjects,? V = 100 in each group, are given 
in Table 1. The Kuder-Richardson reliability 
coefficients based on all 200 cases were Ad = 
.83 and Dn = .75. Ad has its highest correla- 
tion with the MMPI Hs scale, a not too sur- 
prising observation in view of the 18-item 
overlap between the two scales. In most popu- 
lations, Hs and Ad could be substituted one 
for the other. The Dn scale has as its highest 
correlation that with the MMPI K scale, with 
which it shares nine items in common. 

In the clinical application of these scales, 
it has been found that high scores on the Dn 
scale generally describe the uninsightful, the 
anti-intraceptive, the morally virtuous indi- 
vidual; high scores on Dn are also seen with 
“muted” or pseudo-normal profiles. Extreme 
scores on the Ad scale, on the other hand, 
depict the person in general psychologic dis- 
tress who complains and is anxious about his 
somatic functions and who also reports dis- 
turbances in his object relationships. Such 
high scores on Ad are usually seen with over- 
all elevated MMPI profiles. When both scales 
have moderate to high scores, equivalent to 
an elevation on the original Hy scale, there is 
the distinct clinical impression of the pres- 
ence and operation of conversion neurotic dy- 

2The subjects were tested as part of a project 


supported by a grant (M-1132) from the National 
Institute of Mental Health. 


and Jerome Fisher 


Table 1 
Correlations of Admission and Denial Scales with 
MMPI Scales for Two Populations 


Patients referred to 
psychiatric service 


General medical 
patients 


Ad 


MMPI 


scale 


Dn Ad Dn 
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namics in the behavior and history of the in- 
dividual. 
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Psychoanalytic theorists such as Fromm- 
Reichman (3), Rosen (6), and Arieti (1) 
have hypothesized that schizophrenia has its 
origin in the early relationship between 
mother and child. They imply that the mother 
of a schizophrenic child has certain destruc- 
tive types of attitudes toward children which 
she conveys to the child through her behavior 
and verbalizations. Unfortunately, we must 
rely on retrospective evidence bearing on this 
hypothesis. Tietze (9) interviewed a group of 
mothers of schizophrenics and characterized 
them as overanxious, obsessive and domineer- 
ing. However, no control group was compared 
with these mothers, and one may question the 
objectivity of an open interview with a mother 
already identified as the mother of a schizo- 
phrenic. 

The need for more objective techniques 
to assess parental attitudes encouraged other 
researchers to develop objective instruments. 
The typical parental attitude survey uses 
items which are stated in the form of gener- 
alities and clichés in order to make them 
sound socially acceptable. The parent is asked 
to state whether she mildly or strongly agrees 
or disagrees with these items. Freeman and 
Grayson (2) used the Shoben Parent-Child 
Attitude Survey in comparing 50 mothers of 
schizophrenics with 50 control mothers. They 


1 The authors are grateful to Donald Moore and 
Margaret Morgan for their permission in making 
the facilities of Carter Memorial Hospital available 
for this research. Harry Brittain assisted in the 
analysis of the data. 


found the mothers of schizophrenics scored 
significantly higher on the Possessive and Ig- 
noring scales (on the latter scale they ac- 
cepted the .10 level of confidence) than the 
control mothers. Unfortunately, they did not 
control educational or socioeconomic differ- 
ences between their groups. From the way in 
which the control group data was gathered, 
it is likely that differences existed in the edu- 
cational levels of the groups. Furthermore, 
since they relied on volunteer mothers so- 
licited by volunteer students, various selec- 
tive factors could have determined the scores 
in the control group. 

Mark (4), in a more carefully controlled 
study, found that a significant number of 
attitude items distinguished a group of 100 
mothers of schizophrenics from 100 control 
mothers. However, McFarland (5) attempted 
to replicate this study with the same items 
and failed to find a significant number of 
items distinguishing the two groups. 

The purpose of the present study was to 
retest the hypothesis that mothers of schizo- 
phrenics have more severe (controlling and 
rejecting) parental attitudes than mothers of 
normals. 


Procedure 


The Parental Attitude Research Instrument 
(PARI) was the main instrument used in this 
study. This test was developed by Schaefer 
and Bell (7), with items modeled after the 
types used by Mark (4) and Shoben (8). 
An attempt was made to develop a number of 
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short, but internally consistent, scales meas- 
uring specific types of attitudes, to permit 
somewhat broader interpretations than can be 
made from post hoc item analyses. The test 
consists of 23 scales, each of which measures 
some attitude theoretically relevant to per- 
sonality development in children. Normative 
data are furnished by Schaefer and Bell (7), 
and in a previous article by Zuckerman, Rib- 
back, Monashkin, and Norton (11). A factor 
analysis of the scales using a heterogeneous 
sample, including the group reported in this 
paper, can be found in the latter article. The 
factors found in this study were used to de- 
rive summary scores in the current study by 
weighing scales roughly by their factor load- 
ings. Factor A seems to be an Authoritarian- 
Controlling factor which loads heavily on 16 
of the scales. Factor B seems to be a Hos- 
tility-Rejection factor which loads on three 
scales. Factor C seems to be a Democratic 
Attitude factor which loads on three scales. 


Subjects 


Forty-seven mothers of male and female 
patients diagnosed as schizophrenic in an 
acute treatment hospital were given the 
PARI. Ratings of pathology on 30 cases of 
mixed diagnoses were available from a previ- 
ous study and provided some information on 
the reliability of diagnoses at the institution. 
A seven-point rating scale ranging from mildly 
neurotic to severely psychotic was used in 
rating the case-history summaries of the pa- 
tients. Ratings were global, but factors con- 
sidered included severity of symptoms, chro- 
nicity of the disorder, response to treatment, 
and premorbid adjustment. Two of the au- 
thors made the ratings independently, and 


Table 1 


Relation Between Case History Rating and 
Psychiatric Diagnosis 





Case history rating 





Psychiatric diagnosis 13 3545 57 





Nonschizophrenic 8 2 1 
Schizophrenic 0 2 17 





Note.—Chi square, corrected (combining cells) = 15.56. 
df = 1,9 < .001, 
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Table 2 
Composition of Mothers of Schizophrenic and Mothers 
of Normal Groups 


Normals’ 
mothers 


Schizophrenics’ 
mothers 
Level of Mean 
education N age 





52.4 
48.0 
50.7 


49.6 


Grammar school 
High sc hool 
College 


All cases 


their ratings correlated .80. The agreement 
between the raters and the diagnosis assigned 
the patient is summarized in Table 1. It can 
be seen that there is high agreement between 
psychiatric diagnosis and the ratings except 
in the middle of the scale where the border- 
line cases would be expected to fall. Although 
there is probably some contamination between 
the two sets of data, the experimenters felt 
justified in using the psychiatric diagnosis of 
schizophrenia as a criterion, recognizing that 
wide behavioral differences may exist within 
the disorder diagnosed as “schizophrenia.” 

A group of 47 mothers of normals was de- 
rived from the older age range of the normal 
sample (11) and from some additional nor- 
mals gathered at a later date. The control 
group consisted of mothers given the PARI 
at church social and nursery school meetings, 
with some mothers of students from a night 
extension college. 

Since education of the mother seems to 
influence most of the scales and one of the 
factors (11), it was put into the analysis of 
variance design using three levels: grammar 
school, high school, and college. Age differ- 
ences were then compared between diagnostic 
groups across levels. At the grammar school 
level, there was a significant age difference; 
therefore, the five oldest mothers of schizo- 
phrenics and the five youngest mothers of 
normals were discarded. The resultant sam- 
ples, containing 42 mothers in each group, 
are described in Table 2. Sex of the patient 
was considered before grouping mothers of 
male and female schizophrenics together. 
Twenty-one mothers of male schizophrenics 
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were compared with 21 mothers of female 
schizophrenics. A significant difference was 
found on only one scale. Therefore, the 
groups were combined. Other variables con- 
sidered were the child’s diagnostic subclassifi- 
cation, paranoid vs. other schizophrenics, and 
the time interval between the admission of the 
patient and the administration of the test to 
the mother. Neither of these variables seemed 
to markedly influence the average parental 
attitude scores. 


Results and Discussion 


The results of the analyses of variance on 
Factors A, B, and C, and on 20 of the indi- 
vidual scales are reported in Table 3. Only 
one of the differences between groups was sig- 
nificant. The mothers of schizophrenics scored 
significantly lower on Scale 8, strictness, than 
the controls. However, 1 out of 23 compari- 
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sons may be considered a finding attributable 
to chance. The differences between education 
levels were significant for Factor A and 14 of 
the 20 individual scales. The interaction ef- 
fects were significant for Factor A and eight 
of the individual scales. An analysis of the 
interaction effect in Factor A can be seen in 
Table 4. At the grammar school level, the 
mothers of schizophrenics tend to score higher 
than the controls, and at the high school level 
they scored significantly lower than the con- 
trols. No differences were found at the college 
level. The results on the individual scales tend 
to follow the same pattern. If we had included 
the 10 subjects discarded to make age equiva- 
lent at the grammar school level, the interac- 
tion effect would have been even more pro- 
nounced. 

The interaction effects are difficult to inter- 
pret. They may indicate that the way the 


Table 3 


Analysis of Variance Results in Comparisons of Mothers of Schizophrenics and Mothers of Normals* 


Score or scale 


Child’s 


diagnosis 


Mother’s 


education Interaction 





Summary Scores 


A—aAuthoritarian-Control 
B—Hostility-Rejection 
C—Democratic Attitudes 


Scales 

2—Fostering Dependency 
3—Seclusion of the Mother 
4—Breaking the Will 
5—Martyrdom 

6—Fear of Harming the Baby 
7—Marital Confli-t 
8—Strictness 

9—Irritability 

10—Excluding Outside Influences 
11—Deification 

12—Suppression of Aggression 
13—Rejection of Homemaking Role 
15—Approval of Activity 
16—Avoidance of Communication 
17—Inconsiderateness of Husband 
18—Suppression of Sex 
19—Ascendance of the Mother 
20—Intrusiveness 
22—Acceleration of Development 
23—Dependency of the Mother 


5.80** 
8.80** 
13.07** 
12.21** 
2.76 
3.99* 
2.80 
1.44 
10.51** 
11.07** 
4.24* 
3.82 
10.30** 
11.75** 
11.13** 
9.78** 
4.10* 
3.53* 





* F ratios of less than one are indicated by dashes. 
* Significant at or below the .05 level. 
** Significant at or below the .01 level. 
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Table 4 


Comparisons of Means of Mothers of Schizophrenics 
and Mothers of Normals by Levels on 
Summary Score A 








Schizo- 
phrenics’ Normals’ 
mothers mothers t ? 


Level of 
education 





242.09 
191.58 
172.86 


214.91 1.88  .05-.10 
219.50 2.86 01 
155.86 94 3-4 


Grammar school 
High school 
College 





mothers of schizophrenics respond to the test 
depends on their familiarity with the cultural 
expectations for parental attitudes, which, in 
turn, may depend on their education. The 
samples at each level are not adequate to 
justify a conclusion. There is, however, a 
greater relationship between education and 
parental attitudes in mothers of schizophrenics 
than in the controls. The correlation between 
Factor A and education in the original 47 
mothers of schizophrenics was — .632. In the 
original 47 controls it was only — .287. The 
difference between these two correlations was 
significant beyond the .05 level. 

The hypothesis of a difference between pa- 
rental attitudes of mothers of schizophren- 
ics and other mothers was not confirmed. It 
would be difficult to disprove this hypothesis, 
since the possibility always remains that other 
instruments or techniques may reveal a dif- 
ference. Toms (10) compared 30 mothers of 
male schizophrenics and 30 control mothers, 
using a variety of instruments including an 
interview, vocabulary test, the MMPI, and a 
short TAT. Unfortunately, there was a dif- 
ference in education between the two groups 
which was not taken into account in the 
analyses. However, the general conclusion 
was that the hypotheses of personality dif- 
ferences between mothers of schizophrenics 
and other mothers were not confirmed from 
any of the techniques. One interesting result, 
which bears on our speculations about our 
interaction effects, was that the mothers of 
schizophrenics scored significantly higher on 


the two MMPI scales measuring defensive- 
ness (L and K). These findings suggest that 
further research on the “schizophrenogenic 
mother” hypothesis might do better using 
tests which control for social desirability and 
response suppression. 


Summary 


It was hypothesized that mothers of schizo- 
phrenics have more severe (controlling and 
rejecting) parental attitudes than mothers of 
normals. Forty-two mothers of schizophrenic 
patients and 42 control mothers were com- 
pared on the Parental Attitude Research In- 
strument. The hypothesis was not supported, 
although significant interactions were found 
between the two groups and levels of educa- 
tion. 
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According to the psychology of personal 
constructs as set forth by Kelly (2), each in- 
dividual has a conceptual system (an organ- 
ized set of personal constructs) with which he 
gives structure to the world around him. Basic 
to the theory is the constant attempt by each 
individual to anticipate events in his sur- 
rounds. Hence, man is thought of in some- 
what the same light as the scientist in his 
continual process of prediction and validation. 
Just as the scientist begins with a theory and 
makes his predictions on the basis of the con- 
cepts from that theory, man has a theory of 
life composed of bipolar personal constructs. 
His predictions are made along the lines of 
these personal constructs. 

In applying this theoretical framework, 
Kelly and his associates have developed a 
technique designed to elicit the construct 
framework of any given individual. This tech- 
nique is the Role Construct Repertory Test 
(RCRT). 

With the RCRT, the subject (S) thinks 
about certain people he knows—people mean- 
ingful to him and who represent a variety of 
types. These people are referred to as “ele- 
ments” and are determined by a role-title list. 
The S assigns people he knows to fulfill the 
role titles. Examples are as follows: 


. Write your own name in the first blank here. 

. Your closest present friend of the same sex as 
yourself. 

. The most successful person whom you know 
personally. 


The S is then given these elements three at 
a time and asked to think about the three 
people and decide some important way in 
which two are alike and which sets them 
off from the third person. The contrasts are 
listed. Kelly assumes they are important con- 
structs in the individual’s personal construct 
system. 

Certain other assumptions underlie the 


RCRT. One of these is the assumption of the 
“representativeness of the elements.” “If the 
test is to indicate how the subject develops 
his role in the light of his understanding of 
other people, it is necessary that the other 
people appearing as elements in the test be 
sufficiently representative of all the people 
with whom the subject must relate his self- 
construed role” (2, p. 230). On the basis of 
this assumption, Kelly constructed a list of 
role titles; i.e., self, mother, father, ex-flame, 
pal, authorities, etc. 

This assumption is the focus of concern in 
this paper. Kelly assumes representative ele- 
ments (representative role titles) are neces- 
sary to elicit the most crucial personal con- 
structs. The present research began with the 
simple question: “What would happen if the 
representative elements assumption were vio- 
lated—and why?” 

The problem was further limited to the 
question of consistency of construct labels 
with and without the fulfillment of the above 
assumption. Hunt (1) has previously investi- 
gated certain aspects of consistency with the 
RCRT. He examined Ss on two occasions 
with a one-week time interval. Essentially dif- 
ferent role-title lists were used each time. 
However, the two lists were equated as to 
sex, relatives, authority figures, etc. With his 
technique of determining equivalence of con- 
structs, he found the average percentage of 
agreement in constructs used on the two 
protocols to be 69% with a standard devia- 
tion of 6%. 

In Hunt’s research, the assumption of rep- 
resentative elements was fulfilled in both 
forms of the test. In the present research, we 
are concerned with the question of whether 
or not similar results will occur if the as- 
sumption is violated in one form. The pre- 
dictions were as follows: Construct labels 
elicited by the RCRT will be consistent 
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through time when a representative role-title 
list is used. Consistency will also occur when 
the role-title list is nonrepresentative, in this 
case, homogeneous. 


Procedure 


The Ss for this investigation included 20 
psychiatric aides randomly divided into two 
equal groups. One of these groups is referred 
to as the “representative elements group 
(REG)”; the other, the “nonrepresentative 
elements group (NREG).” Constructs were 
elicited from the Ss in the REG in the same 
manner as the Role Construct Repertory Test 
as described by Kelly. (In the present form 
there were 19 role titles and 22 sorts of 3 ele- 
ments each.) All Ss were re-examined with 
the same technique after a three-month in- 
terval. The Ss in the NREG were treated in 
the same manner with a singular exception— 
the role-title list was not used. In its place, 
each S was merely asked to list the names of 
19 personal friends. 

According to the hypothesis, we would ex- 
pect a significant number of constructs elicited 
on the first administration to be repeated on 
the second administration. Such would apply 
to both groups. Each construct for each S on 
the first administration was evaluated indi- 
vidually. If either pole of a given construct 
were repeated verbatim anywhere in the en- 
tire repertory elicited on the second adminis- 
tration, this was considered as repetition. Bi- 
nomial expansion was utilized to establish 5% 
significance levels for number of constructs 
repeated and number of Ss repeating a sig- 
nificant number of constructs. 


Results 


All protocols were evaluated by comparing 
both protocols for each S. If either or both 
poles of any given construct on the first 
administration were repeated verbatim on 
the second administration, a replication was 
scored. Thus the potential range of scores 
was 0-22. Applying the binomial expansion, 
16 of 22 replications would be significant at 
the .05 level. 

In the representative elements group, nine 
of the ten Ss repeated significant numbers of 
constructs (p < .02). In the nonrepresenta- 
tive elements group, two Ss did so (p > .20). 
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Table 1 


Distribution of Replicated Constructs and 
Significant Subjects 





REG NREG x 








170 
50 


Constructs Repeated 12 
Constructs Not Repeated 7 
Significant Subjects i) 2 
Nonsignificant Subjects 1 7 





(One S in the latter group had to be dropped 
because he was not available for re-examina- 
tion.) The results seem to indicate a direct 
relationship between representative role titles, 
on which to base constructs, and consistency 
of construct labels. 

Chi square was computed for the two 
groups comparing the number of Ss repeating 
significant numbers of constructs with the 
number repeating insignificant numbers. These 
data are presented in Table 1. 

Certain questions arose at this point of the 
research. Perhaps the most obvious was the 
possibility that these results are the result of 
the role-title list yielding the same people on 
each administration, while a different group 
of friends was used each time. Another ques- 
tion is that of whether or not people, when 
responding to such a test, are responding to 
the types of people described on the role-title 
list rather than or in addition to the particu- 
lar people involved? Lastly, are different types 
of constructs elicited when the elements are 
heterogeneous as opposed to homogeneous? 

In an attempt to clarify the results, the Ss 
in the NREG were re-examined after another 
three-month interval. On this administration, 
Ss used the identical list of 19 friends they 
had produced on the second administration. 
These friends were listed in identical order, 
and the Ss were given identical sorts to elicit 
constructs. In comparing the second to the 
third administration, three Ss repeated sig- 
nificant numbers of constructs (p > .20)— 
substantially the same results as the previous 
administration. 


Discussion 


It would seem apparent, on the basis of the 
above, that fulfillment of the assumption of 
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the representativeness of the elements is a 
matter of importance, at least in terms of 
consistency. Construct labels were consistent 
through time when the assumption was ful- 
filled and were inconsistent through time when 
the assumption was violated. 

It would appear, therefore, that conceptual 
consistency on the RCRT, at least insofar as 
it applies to the construct labels, must be in 
some way a function of the role-title list and 
the types of constructs elicited thereby. 

The reasons can only be conjectured. The 
most likely source of explanation is Kelly’s 
theory itself. The fundamental postulate of 
his system is stated as follows: “A person’s 
processes are psychologically channelized by 
the way in which he anticipates events” (2, 
p. 46). One of the corrolaries of this postulate, 
the “Organization Corrolary,” states: “Each 
person characteristically evolves, for his con- 
venience in anticipating events, a construc- 
tion system embracing ordinal relationships 
between constructs” (2, p. 56). 

According to this corrolary, personal con- 
structs within any individual’s system are ar- 
ranged in a hierarchical structure. “Within a 
construction system there may be many lev- 
els of ordinal relationships, with some con- 
structs subsuming others and those, in turn, 
subsuming still others” (2, p. 57). Subsuming 
constructs are termed superordinal and sub- 
sumed constructs subordinal. 

Relating these corrolaries to the present 
data, it would seem plausible that, when 
thinking of the people represented on the 
role title list, Ss use personal constructs ca- 
pable of describing not only specific indi- 
viduals, but also describing similarities and 
differences between groups or types of peo- 
ple, ie., family, authorities, peers, etc. De- 
scription in this case refers to the process of 
labeling dimensions on which one could an- 
ticipate the behavior of these individuals and 
groups. 

In the present research, the “nonrepresenta- 
tive elements group” described only a group 
of friends. These Ss were not concerned with 
describing groups; their sample of people was 
fairly homogeneous. They were concerned, as 
it were, with only one possible subgroup of 
the heterogeneous group used by the “repre- 
sentative elements group.” The construct of 
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“friend versus nonfriend” might well be one 
of the group of dimensions used with the 
heterogeneous group of people. 

In other words, it seems plausible that con- 
sistency occurs when the task requires the use 
of generally superordinate constructs. More 
specifically, it might be hypothesized that the 
number of superordinate constructs produced 
in a concept formation situation is a function 
of the heterogeneity of the objects to be con- 
strued. It may well be that each individual 
has an infinite number of subordinate con- 
structs and hence a great deal of leeway in 
construing within the context of a superordi- 
nate construct; in this case, the context of 
friend versus nonfriend. Such an explanation 
would seem to be consistent with Kelly’s fun- 
damental postulate and the “Organization 
Corrolary.” 


Summary 

The purpose of this study was essentially 
that of testing the differential effects of het- 
erogeneity and homogeneity of role titles on 
the Role Construct Repertory Test. Two 
groups of subjects were administered the test. 
One group utilized a heterogeneous role-title 
list, the other a homogeneous group of ele- 
ments. Significant differences were found in 
terms of the consistency of construct labels. 
In the group using the role-title list, a signifi- 
cant (p< .02) number of subjects demon- 
strated consistency of construct labels through 
time. In the group using homogeneous ele- 
ments, only 20 per cent (p> .20) of the 
subjects demonstrated such consistency. 

It would appear most likely that such re- 
sults are a function of the heterogeneity of 
the role-title list. When presented with a het- 
erogeneous group of stimulus objects on which 
to perform sorts and attach construct labels, 
individuals tend to produce more superordi- 
nate constructs, whereas more subordinate 
constructs are produced when the construed 
objects are homogeneous. 
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This study was designed to determine the 
relationships of the Taylor A Scale to clinical 
criteria of manifest anxiety, and to compare 
these relationships with those between Welsh’s 
Anxiety Index (AI) and the same clinical cri- 
teria. 

The Ss were 44 male members of the U. S. 
Armed Forces referred to a military hospital 
for psychiatric consultation. 

Three psychologists and a psychiatrist in- 
dependently rated each S on a combined check 
list and rating scale of anxiety. The psychia- 
trist completed his ratings after his initial 
diagnostic interview. Then the psychologists 
simultaneously performed their ratings while 
one of them also administered the Wechsler- 
Bellevue Intelligence Scale to the S. Finally, 
the MMPI was administered, which provided 
both the A scale and AI variables. 

The intercorrelations among the psycholo- 
gists in their anxiety ratings ranged between 
.78 and .80. However, none of their ratings 


1 This study is based upon a dissertation submitted 
for the Ph.D. degree, University of Iowa, 1952. The 
writer wishes to express his gratitude to Grant W. 
Dahlstrom, Committee Chairman, and the several 
staff members of the Neuropsychiatric Service, Fitz- 
simmons Army Hospital, who so freely participated 
in obtaining the data. 

2An extended report of this study may be ob- 
tained without charge from Carl G. Lauterbach, 243 
South 80th St., Tacoma, Wash., or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. 5580, remitting $1.25 for microfilm or $1.25 
for photocopies. 


showed a significant relationship to the psy- 
chiatrist’s ratings. Thus, a composite of the 
three psychologists’ ratings was treated as a 
separate criterion of anxiety from the psy- 
chiatrist’s ratings. An item analysis of the 
check list of anxiety signs resulted in four 
signs being selected, assigned weights, and 
summed for each S to form a third criterion 
of anxiety, referred to as the composite 
weighted check list of the psychologists. A 
psychiatric diagnosis formed the fourth cri- 
terion of anxiety. 

The A scale correlated .44 (p < .01) with 
the composite rating of the psychologists and 
38 (p< .05) with the composite weighted 
check list. The AI correlated .36 (p < .05) 
with the composite rating of the psychologists. 
Neither the A scale nor the AI correlated sig- 
nificantly with the psychiatric criteria of 
anxiety. 

The two anxiety test measures and the four 
criteria of anxiety were correlated with the 
three validity scales of the MMPI and with 
the F — K dissimulation index. The A scale 
related significantly to the F, K, and F —K 
scales, while there was a lack of significant 
relationships, in general, between the validity 
scales and the criteria of anxiety. In the case 
of the K scale, it appears that K could be 
used as a variable to suppress the error in the 
A scale to improve the latter’s relation to the 
criterion measures. 


Brief Report. 
Received March 15, 1958. 
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in Clinical Judgment’ 
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Whether he is reading a case history, con- 
ducting a psychological test, or interviewing 
a client, the clinical psychologist has as a 
major objective the formulation of certain 
kinds of judgments about his client. On the 
basis of these judgments, the psychologist de- 
velops his plans for the management of the 
case or for the kinds of recommendations he 
will make in a professional report. Very little 
is known, however, about the variables that 
are operating in making these judgments. 

There appear to be some empirical findings 
to suggest that clinical experience, per se, 
does not insure more accurate clinical judg- 
ments about clients, as shown by Luft (9, 
10), Cline (2), and Soskin (11). Some stud- 
ies show that the method by which clinicians 
acquire information about the client may not 
have to be the traditional interview procedure 
in order for accurate judgments to be made, 
as, for example, the Borke and Fiske study 
(1) finding that clinicians were able to make 
as accurate predictions about neurotic pa- 
tients after reading interview statements as 
they could make after face-to-face interac- 
tions with these patients, and Luft’s report 
that readers of interviews did as well as 
listeners in predicting client responses to ob- 
jective questionnaires. Nor is it clear today 
just how much information a clinician needs 
about a client in order to make valid judg- 


1 This paper is based upon a doctoral dissertation 
completed at the State University of Iowa, August, 
1957. The writer acknowledges the advice and sug- 
gestions of Leonard D. Goodstein, under whose di- 
rection this study was done. Gratitude is also ex- 
pressed to Leonard Feldt for his consultation about 
statistical processing of the data. 

2 Now at the Department of Psychology and the 
Testing and Guidance Bureau, The University of 
Texas. 


ments, or what kinds of data he needs. Stern, 
Stein, and Bloom (12), for example, found 
that increased observational and test data 
tended to be associated with more accurate 
judgments about individuals, but Gage (3) 
reported a lowering of judgmental accuracy 
with increased observational data and Kost- 
lan (7) found that unless a social history was 
included, increased clinical testing data did 
not make for better predictions. 

In an effort to learn more about the vari- 
ables in an interview situation that may play 
a role in influencing the judgments of clini- 
cians about their clients, three variables were 
manipulated in the present study: the voice 
of the interviewee, the content of the inter- 
view statements, and the training and experi- 
ence of the clinician. 


Procedure 


The first counseling interviews of three 
male clients with personal problems who re- 
ported to a university counseling service were 
tape recorded. During this first visit, the 
clients indicated their appraisal of them- 
selves by making Q sorts with Deck 10 from 
Hilden’s Random Sets of Personal Concepts 
(6), by answering Gough’s Adjective Check 
List (4), and a 25-item Self Report Question- 
naire adapted from the Guilford-Martin In- 
ventory of Factors GAMIN (5). After re- 
peating these measures one week later, each 
of the clients was included in the study since 
the reliabilities of the Q sorts were .80 or 
higher, and the percentage of overlap of items 
on the other two measures exceeded 85 per 
cent. 

From each of the clients’ interviews, a ten- 
minute portion of the tape recording was se- 
lected on the basis of giving a general state- 
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ment of the client’s complaints without stress 
on specific etiological material, and an exact 
typescript was prepared. Then, an experienced 
actor re-enacted the interview selection with 
the original counselor in whatever manner the 
actor felt the script indicated, but without the 
actor hearing any recordings of the actual 
client. The same male actor was used to re- 
enact each script. 

Thus, for each of the three clients, any one 
of three sets of data could be supplied to a 
given group of clinical judges: (a) the actual 
tape-recorded selection with the client, (4) 
the tape-recorded selection as re-enacted by 
the actor, or (c) an exact typescript of the 
interview selection. Also, since there were se- 
lections from interviews of three clients, it 
was possible to vary the content of the in- 
terviews. 

Twenty-four male clinical psychologists 
with the Ph.D. degree and affiliation with 
the American Psychological Association were 
used as the Ph.D. judges. Twenty-four male 
trainees in clinical psychology constituted the 
Trainee judges and included either graduate 
students with at least one year of practicum 
experience, or clinical trainees in a hospital 
who had completed at least one year of gradu- 
ate work in psychology. A group of Naive 
judges, 24 male undergraduate students from 
the introductory psychology course, were ob- 
tained from Coe College. The Ph.D. and 
Trainee judges were obtained from three Vet- 
erans Administration hospitals: Downey, IIli- 
nois, Iowa City, and Knoxville, Iowa; from 
the Elgin (Illinois) State Hospital, and from 
the Psychopathic Hospital and the Depart- 
ment of Psychology of the State University 
of Iowa. 

Groups of judges were given various com- 
binations of the voice and content conditions 
and after each presentation were asked to 
predict (actually, postdict) how the client re- 
sponded to the three personality tests. The 
accuracy of any judge’s predictions was meas- 
ured by the number of agreements between 
these predictions and the responses of the 
clients, called “Client Key.” Later, although 
they were instructed to predict the responses 
of the clients and not the impressions of 
other psychologists about the clients, the 
judges’ predictions were scored by an “Expert 
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Key” made from a consensus of three psy- 
chologists as to what the clients’ responses 
“should be” after these psychologists had 
studied, independently, the complete case 
folder of each client. 

Each of the three experience levels of 
judges was subdivided into three subgroups 
of eight judges, and the sequence of pro- 
cedure was so arranged that any one sub- 
group heard a different voice and content 
combination than either of the other two sub- 
groups. Each subgroup, but with different 
voice-content combinations, had the follow- 
ing three conditions: (a) a client speak his 
own interview content, (5) an actor re-enact 
a different one of the three interviews, and 
(c) an exact typescript of the third and dif- 
ferent interview selection. The order of pres- 
entation was confounded with sequence sub- 
groups, so that each of the above conditions 
occurred in each possible order of presenta- 
tion as first, second, or last. By using a Lind- 
quist Type TV analysis of variance design (8, 
pp. 285-288), the voice and the content con- 
ditions were “within” effects, and the experi- 
ence levels were “between”’ effects. 


Results 


Table 1 presents the means and the stand- 
ard deviations of the three variables—experi- 
ence, voice, and content—as measured by the 
Client Key. A summary table of F ratios from 


Table 1 


Means, Standard Deviations of Prediction of Client 
Responses with Variables Manipulated 
in the Study 








Q Sort SR Gough ACL* 


Mean Sigma 


Mean Sigma Mean Sigma 





Experience: 
Naive 
Trainee 
Ph.D. 


14.46 1.51 
15.85 1.22 
16.43 1.17 


37.29 
42.89 
42.39 


18.68 
13.90 
20.87 
Voice: 
Client 344 
Actor 333 
Script 367 


16.39 2.22 
14.60 2.80 
18.75 2.18 


39.67 
38.81 
44.10 


16.73 
14.99 
16.49 


Content: 
#1 328 
#2 366 
#3 350 .099 


15.25 2.44 
14.83 2.93 
16.65 1.68 


$1.72 
33.53 
37.32 


14.92 
12.41 
15.23 





* To reduce 300-item Gough ACL scores to smaller numbers, 
190 was subtracted from total agreements between judge and 
client responses. 
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Table 2 


Summary Table Showing F Values for the Three Measures 








Q Sort 


Key Client Expert 


Self Report Gough ACL 








Client Expert Client Expert 





Between Effects 
Experience 
Voice-Content (b) 
Voice-Cont.-Exp. (b) 


Within Effects 
Voice 1.93 12.41** 
Content 2.35 54.62°* 
Voice-Content (w) 1.87 1.71 
Voice-Experience 1.74 
Content-Experience 1.04 
Voice-Cont.-Exp. (w) 0.36 


953° 
1.02 
0.53 


11.15** 
3.39 
2.61 


0.56 
0.26 
1.68 


13.76** 
1.15 
1.08 


3.95* 
0.67 
0.88 


2.47 
6.98** 
2.09 


0.01 
3.68* 
0.31 


15.02** 

16.55** 
8.69** 
0.79 
0.48 


1.55 


12.47** 

22.50°* 

10.12** 
0.71 
1.84 
0.04 


2.24 
16.93** 
0.72 
0.98 
2.64 
0.51 





* Significant beyond .05 level of confidence. 
** Significant beyond .01 level of confidence. 


the analysis of variance is presented in 
Table 2. 

With the Q sorts, it can be observed that 
the two trained groups of judges were more 
accurate in their predictions than the naive 
group, and this difference was shown by fur- 
ther analysis to be statistically significant 


(p < .01), but the difference between the ac- 
curacy of predictions by Trainee judges and 
Ph.D. judges is not statistically significant. 
The differences between the three voice con- 
ditions are not significant with the Q sorts, 
although the actor’s voice tended to lower the 
accuracy of the predictions about the client’s 
responses. Also, with the Q sorts, when dif- 
ferent interview contents were used, there 
were no statistically significant differences in 
the accuracy of the predictions. 

With the Self Report Questionnaire (SR), 
both groups of trained judges were signifi- 
cantly more accurate in predicting the clients’ 
responses than were the Naive judges (p < 
.01), but again the difference between the 
Trainee judges and the Ph.D. judges was not 
significant. With SR, the three voice condi- 
tions and also the three different contents of 
the interviews influenced the accuracy of the 
judges significantly, as shown in Table 2. Be- 
cause there was a Voice-by-Content interac- 
tion, the effects of voice conditions were com- 
pared over each content separately and it 
was found that the client’s voice produced 
the most accurate predictions for Content 1, 


but no significant voice effects occurred with 
the other content conditions with SR. 

With the Gough Adjective Check List, al- 
though the two trained groups again appear 
to be superior to the Naive judges, statistical 
analysis summarized in Table 2 indicated 
that the differences are not statistically sig- 
nificant. Again, the actor’s voice was associ- 
ated with the poorest predictive accuracy. 
Content 1 was associated with accuracy scores 
strikingly superior to those obtained when 
Content 2 or Content 3 occurred (p < .01). 
Because with Gough ACL a Voice-by-Content 
interaction occurred, an analysis similar to 
that described above with SR data was per- 
formed, and it was observed that the client’s 
voice was associated with significantly lower 
accuracy scores with Content 3, but this did 
not occur with the other contents. 

With the Client Key, the three measuring 
instruments used—Q sorts, SR, and Gough 
ACL—reacted differently to the variables of 
the experiment (Table 1). With the Q sorts 
and the Gough ACL, the script gave the most 
favorable scores for the voice conditions. With 
each of the three personality measures, the 
actor’s voice lowered the accuracy of the pre- 
dictions. In regard to the content of the in- 
terviews, the three personality mez “res re- 
acted differently: Content 1 gave sti..ingly 
superior scores over other contents with the 
Gough ACL, but was the poorest content con- 
dition with Q sorts. Content 2 was associated 
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with the best scores on the Q sorts, but was 
the poorest content condition with the other 
two measures. With SR, Content 3 was associ- 
ated with the highest predictive accuracy. 
Table 3 presents the means and the stand- 
ard deviations of the three variables as scored 
by the Expert Key. The three clinicians who 
composed the key, working independently, 
answered the three personality measures as 
they believed the client “should have re- 
sponded” after they had studied all pertinent 
case history and interview reports as well as 
the actual recorded selections heard by the 
judges in the experiment. The average Q sort 
of the three judges and those SR and Gough 
ACL items on which two or more of the three 
“experts” agreed, constituted the Expert Key. 
As may be seen, the trends in the data are 
similar to those of the Client Key (compare 
Table 1 and Table 3), although now the ac- 
curacy scores are considerably higher. The 
major discrepancy between the results based 
on the two keys occurs in the Q-sort data. 
There, significant voice and content effects 
are revealed with the Expert Key, as shown in 
Table 2. In addition, the “between” compo- 
nent of the Voice-by-Content interaction was 
significant, and the latter effect also occurred 
with the Gough ACL data. Such a “between”’ 
effect may be associated with the dispropor- 
tionate number of State University of Iowa 
judges in the three sequence subgroups. Sub- 
group I contained twelve such judges, whereas 
Subgroup II contained seven, and Subgroup 


Table 3 


Means, Standard Deviations with the Variables 
Manipulated in the Study 
(Expert Key) 








Q Sort SR Gough ACL 


Mean Sigma 





Mean Sigma Mean Sigma 





Experience : 
Naive 
Trainee 
Ph.D. 


45.42 
45.54 
45.89 


Voice : 
Client 
Actor 
Script 


45.67 
43.65 
47.53 


Content: 


#1 J ° 51.57 
#2 ‘ . 41.22 
#3 ‘ J 44.06 
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III contained none. While this effect did not 
influence the predictions of how the clients 
responded, it apparently exerted an influence 
when the judges were scored by an Expert 
Key devised by three State University of 
Iowa psychologists. 

Because of the relatively large differences 
between the average scores under the Client 
and Expert keys, it was considered expedient 
to test the significance of these differences by 
the simple nonparametric sign test (13, pp. 
430-435). The results of this test clearly 
verified the observation that the judges within 
each group show greater agreement with the 
Experts’ predicted responses for the clients 
than with the responses of the clients them- 
selves. 

Discussion 

From the results of this study, it appears 
that judges with one or more years of clini- 
cal experience are able to make more accurate 
predictions about clients’ responses to certain 
personality measures than can be predicted by 
untrained judges. But among judges who have 
had at least one year of training, increased 
experience does not appear to be associated 
with a significant increase in the accuracy of 
predictions. There is the problem of whether 
these results imply that whatever skill is 
added to the individual’s ability to make 
judgments along these lines is acquired rap- 
idly, during the early period of graduate train- 
ing, and that increments thereafter are too 
gradual to make for significant experience 
level differences. The results could be inter- 
preted also to suggest that selection may be 
the important factor here: that certain kinds 
of people, better than average in ability to 
judge the responses of people who are upset, 
enter clinical psychology and that training to 
the level of the Ph.D. does not increase their 
skills along these lines. Another possibility is 
that perhaps the clients used in this study 
placed the more experienced Ph.D. judges at 
a disadvantage, since they work mostly in 
hospital settings, and the clients used in the 
study were outpatients of a university coun- 
seling service. 

In the present study, unlike the Luft studies 
where clinicians were no more accurate than 
nonclinicians, brief interview selections were 
presented to the judges. It could be that the 
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trainee and the experienced clinician are able 
to formulate better personality appraisals on 
the basis of little data than are naive judges. 
Luft’s findings with one-hour interviews that 
experienced clinicians are no more accurate 
than inexperienced judges suggests the possi- 
bility that as the amount of interview mate- 
rial available as a basis for judgment in- 
creases, the superiority of the experienced 
judges over naive judges may diminish. Or, 
there may be differences due to age differ- 
ences, since in the present study, the Naive 
judges were undergraduates. But these specu- 
lations must await empirical testing. 

While it is true that, for all three person- 
ality measures employed in this study, a 
discrepant (the actor’s) voice lowered the ac- 
curacy of clinical judgment, reading the inter- 
view selection allowed more accurate predic- 
tions of Q sorts and Gough ACL responses 
than did listening to the client. This corre- 
sponds to Luft’s report that student readers 
did as well as student listeners in predicting 
responses to objective questionnaires, and 
seems related to Borke and Fiske’s finding 
that reading interview statements was asso- 
ciated with as accurate predictions as oc- 
curred after face-to-face interaction with the 
client. 

In this study, no significant Voice-by-Ex- 
perience or Content-by-Experience interac- 
tions occurred. The experience groups were 
more accurate than the Naive judges over all 
conditions studied, and not just one content 
or one voice. 


Conclusions 


1. Two groups of judges with clinical train- 
ing and experience, the Trainees and the 
Ph.D. judges, predicted client responses more 
accurately than the Naive judges. 

2. Between the experienced groups, how- 
ever, there occurred no statistically significant 
differences in the accuracy of prediction of 
client responses. 

3. A voice discrepant from the original 
speaker of the interview content lowered the 
accuracy of the judgments for all groups of 
judges. 

4. Different interview contents appeared to 
be related to differences in accuracy of judg- 
ments about the client, but the instrument be- 
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ing used appeared to be an important vari- 
able here. 

5. The script condition, with total absence 
of voice cues, was associated with the most 
accurate predictions with two of the three 
measuring instruments used. 

6. There was no statistically significant in- 
teraction between experience and voice, or be- 
tween experience and content. 

7. The judges, regardless of experience 
level, were in greater agreement with what a 
consensus of “experts” predicted about client 
responses than in predicting the actual re- 
sponses made by the clients themselves. 


Received July 17, 1957. 
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A Note on the Reliability of the Leiter 
International Performance Scale 
1948 Revision’ 


Heber C. Sharp 


Utah State University 


Since the Leiter International Performance 
Scale (LIPS) was developed, most of the 
reported research has been designed to ex- 
plore the validity of the LIPS by comparing 
it with other measures of intelligence. Reli- 
ability studies seem to have been neglected. 
Leiter (1) states that two types of reliability 
data are available. The type reported most 
frequently compares the standard deviation 
of the LIPS results with those of the accepted 
instrument. Leiter reports a split-half reli- 
ability coefficient of .89 (1, p. 30). Tate (2), 
following Thorndike’s outline (3, pp. 93-95) 
of Holt’s analysis procedure to estimate reli- 
ability, reports a coefficient of .75. The fol- 
lowing data are concerned with a test-retest 
reliability coefficient. 

Procedure. The LIPS was administered to 
48 retarded school children in September or 
October and again in April or May with at 
least a six-month interval between tests. The 
Ss ranged in age from 8 years and no months 
to 16 years and 5 months. 10 Ss were attend- 
ing a special school for seriously retarded chil- 
dren, 21 were attending special classes for re- 
tarded children, and 17 were attending classes 
with other children in the regular schools. 

Results. Statistical data revealed for the 
initial test an M of 65.1, SD 13.67, and the 
range of 38-88. The retest M was 67.77, SD 
13.73, and the range 41-95. The reliability 
coefficient was found to be .91. A ¢ test to de- 
termine the significance of the difference be- 
tween the means yielded a ¢ of 3.15 (p < .01). 


1This study was supported in part through a 
grant from Utah State University research program. 


Discussion. The retest method may intro- 
duce a spurious factor due to carry-over from 
one administration to the other. Ss frequently 
mentioned memory for one or more test items. 
This might account for the difference found 
between the means. Only 6% of the Ss had 
the same score on both test and retest, 27% 
earned lower scores on the retest, and 67% 
showed increases from 1 to 14 IQ points on 
the retest. For this retarded group, some items 
seemed to be improperly placed in the scale. 
Further empirical work concerned with this 
problem is suggested. 

The reliability coefficient of .91 indicates 
both stability and consistency of the LIPS as 
a measuring instrument for Ss at the lower 
end of the intelligence distribution. The simi- 
larity of the variance measures supports fur- 
ther the reliability of the LIPS. Since the Ss 
were drawn from a restricted range of ability, 
thus limiting the spread of scores, the high 
coefficient does not imply necessarily reliabil- 
ity for all types of subjects. 
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